lovE Variant Regulator Molecules 

(Atty Docket No, 109272.150; Client Docket No. MIC005US) 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The invention relates to the fields of microbiology 
and molecular biology. In particular, the invention 
relates to the field of mycology and the production of 
secondary metabolites from fungi. 

Summary of the Related Art 

Secondary metabolites are a major source of 
commercially useful products such as food additives, 
vitamins, and medicines for the treatment of a wide 
variety of infections and diseases. By way of example, 
in 1997 the statin drugs lovastatin, simvastatin, and 
pravastatin, fungal secondary metabolites used in the 
treatment of hypercholesteremia, together had US sales of 
US$7.53 billion (Sutherland et al . , Current Opinion In 
Drug Discovery & Development 4:229-236 (2001)). The cost 
and availability of these plant, bacterial and fungal 
metabolites are frequently determined by limitations 
imposed on production and purification of these compounds 
from culture. This problem is frequently exacerbated by 
the fact that these products are generally produced 
during the stationary phase of bacterial and fungal 
growth. 

A wide variety of methods have been utilized to 
increase the amount of secondary metabolite produced in 
culture. Studies have demonstrated the importance of 
carefully designing the medium in which a fungus is grown 
to maximize the amount of a secondary metabolite produced 
(see, e.g., Hajjaj H, et al . , Appl . Environ. Microbiol. 
67:2596-602 (2001); Lesova, K. , et al . , J . Basic 
Microbiol. 40:369-75 (2000)). In addition, the method of 
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culture or fermentation also impacts directly on the 
amount of secondary metabolite produced. For example, 
see Robinson, T\ , et al . (Appl . Microbiol. Biotechnol . 
55:284-289 (2001)), which demonstrates the advantages of 
solid state (substrate) fermentation. 

In addition to the manipulation of culture and media 
conditions, genetic approaches have been taken to 
increase secondary metabolite production. For example, 
the production of penicillin is limited by the activity 
of two enzymes, encoded by the ipnA and acvA genes, both 
of which are regulated by the pa.cC protein, a zinc-finger 
transcription factor. Naturally occurring mutant alleles 
of the pa.cC locus are known to possess more 

transcription-activating activity than the cognate, wild- 
type allele (see, e.g., Tilburn et al . EMBO J. 14(4): 779- 
790 (1995)). Thus, one genetic approach to increasing 
secondary metabolite production is to identify and 
isolate naturally occurring mutant alleles, the 
expression of which leads to increased secondary 
metabolite production . 

Although many regulators of secondary metabolite 
production in many organisms are known, not all of the 
organisms that produce secondary metabolites are amenable 
to genetic or molecular genetic manipulation. Thus, 
these systems are not generally useful as a source for 
the isolation of naturally occurring mutant alleles and 
are even less useful for the deliberate manipulation of 
secondary metabolite regulator protein structure with the 
aim of creating improved regulators of secondary 
metabolite production. 

It would be advantageous to have improved regulators 
of the biosynthetic enzymes responsible for secondary 
metabolite production. For example, recent studies 
suggest increasing usage of statin drugs, e.g., see 
Waters D.D., Am. J. Cardiol. 88:10F-5F (2001)). Thus, 
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5 demand for statin drugs is likely to increase 

substantially. In order to meet the demand for these and 
other secondary metabolites, new and improved methods for 
the production of secondary metabolites must be 
identified . 
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5 BRIEF SUMMARY OF THE INVENTION 

The invention provides improved secondary metabolite 
regulator proteins that enable increased production of 
secondary metabolites. The invention also provides 
methods to make these improved regulator proteins. 

10 In a first aspect, the invention provides a variant 

regulator protein of secondary metabolite production with 
increased activity than that of the cognate, wild- type 
protein. In certain embodiments of this aspect of the 
invention, the regulator protein is a fungal regulator 

15 protein. 

In an embodiment of the first aspect, the invention 
provides an improved regulator protein comprising an 
amino acid sequence coding for a variant lovE protein 
having at least one specific mutation that gives rise to 
20 greater transcription-activating properties of the 
regulator protein and/or induction of secondary 
metabolite synthesis . 

By way of non- limiting example, certain preferred 
regulator proteins of this aspect of the invention 
25 include at least one of the following mutations: (1) a 
Group 6 amino acid residue mutated to a Group ,2 amino 
acid residue at position 31, in one embodiment the 
mutation represented by F31L; (2) a Group 3 amino acid 
residue mutated to a Group 5 amino acid residue at 
30 position 41, in one embodiment the mutation represented 

by Q41K or Q41R; (3) a Group 4 amino acid residue mutated 
to a Group 2 amino acid residue at position 52, in one 
embodiment the mutation represented by T52I; (4) a Group 
4 amino acid residue mutated to a Group 3 amino acid 
35 residue at position 52, in one embodiment the mutation 
represented by T52N; (5) a Group 4 amino acid residue 
mutated to a Group 5 amino acid residue at position 73, 
in one embodiment the mutation represented by C73R; (6) a 
Group 1 amino acid residue mutated to a Group 4 amino 
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acid residue at position 101 , in one embodiment the 
mutation represented by PI 0 IS; (7) a Group 1 amino acid 
residue mutated to a Group 3 amino acid residue at 
position 101, in one embodiment the mutation represented 
by P101Q; (8) a valine amino acid residue mutated to 
another Group 2 amino acid residue at position 111, in 
one embodiment the mutation represented by Villi; (9) a 
Group 4 amino acid residue mutated to a Group 2 amino 
acid residue at position 133, in one embodiment the 
mutation represented by S133L; (10) a Group 3 amino acid 
residue mutated to a Group 2 amino acid residue at 
position 141, in one embodiment the mutation represented 
by E141V; (11) a Group 3 amino acid residue mutated to a 
Group 5 amino acid residue at position 141, in one 
embodiment the mutation represented by E141K; (12) a 
Group 4 amino acid residue mutated to Group 6 amino acid 
residue at position 153, in one embodiment the mutation 
represented by C153Y; (13) a Group 4 amino acid residue 
mutated to a Group 5 amino acid residue at position 153, 
in one embodiment the mutation represented by C153R; (14) 
a Group 4 amino acid residue mutated to a Group 1 amino 
acid residue at position 281, in one embodiment the 
mutation represented by T281A; (15) a Group 3 amino acid 
residue mutated to a Group 2 amino acid residue at 
position 3 67, in one embodiment the mutation represented 
by N367I; (16) a Group 3 amino acid residue mutated to a 
Group 6 amino acid residue at position 367, in one 
embodiment the mutation represented by N367Y; (17) a 
Group 1 amino acid residue mutated to Group 4 amino acid 
residue at position 389, in one embodiment the mutation 
represented by P3 8 9S; and (18) a Group 1 amino acid 
residue mutated to a Group 2 amino acid residue at 
position 3 89, in one embodiment the mutation represented 
by P389L. 
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5 In some embodiments of the first aspect, the 

invention provides regulator proteins with at least two, 
or at least three, or at least four, or at least five, or 
at least six, or at least seven, or at least eight, or at 
least nine, or at least ten, or at least eleven, or at 

10 least twelve, or at least thirteen, or at least fourteen, 
or at least fifteen, or at least sixteen, or at least 
seventeen, or at least eighteen of the above described 
specific mutations. 

In other embodiments of the first aspect, the 

15 invention provides an isolated lovE variant regulator 
protein selected from the group consisting of SEQ ID 
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In a second aspect, the invention provides a nucleic 
25 acid molecule encoding a lovE regulator of the first 

aspect of the invention. By way of non- limiting example, 
the invention provides a nucleic acid molecule encoding 
the lovE variant regulator protein selected from the 
group consisting of SEQ ID NO: 66, SEQ ID NO: 67, SEQ ID 
30 NO: 68, SEQ ID NO: 69, SEQ ID NO: 70, SEQ ID NO: 71, SEQ ID 
NO: 72, SEQ ID NO: 73, SEQ ID NO: 74, SEQ ID NO: 75, SEQ ID 
NO: 76, SEQ ID NO: 78, SEQ ID NO: 79, SEQ ID NO: 81, SEQ ID 
NO: 82, SEQ ID NO: 83, SEQ ID NO : 84 , SEQ ID NO: 86, SEQ ID 
NO: 87, SEQ ID NO: 88, SEQ ID NO: 89, and SEQ ID NO: 90. 
35 In a third aspect, the invention provides a method 

of increasing the activity of a protein that regulates 
secondary metabolite production comprising: (a) selecting 
a nucleic acid comprising a polynucleotide encoding a 
protein regulator of secondary metabolite production; (b) 
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mutating the nucleic acid to create a plurality of 
nucleic acid molecules encoding variant regulator 
proteins of secondary metabolite production; and (c) 
selecting a variant regulator protein with more activity 
than the cognate, wild- type protein. 

In various embodiments of the third aspect, the 
secondary metabolite is a fungal secondary metabolite. 
In certain embodiments of the third aspect, the protein 
regulator of secondary metabolite production is a 
transcription factor. In certain embodiments of the third 
aspect, the protein regulator of secondary metabolite 
production is a transmembrane transporter, protein that 
mediates secretion, kinase, G-protein, cell surface 
receptor, GTPase activating protein, guanine nucleotide 
exchange factor, phosphatase, protease, 

phosphodiesterase, bacterial protein toxin, importin, 
RNA-binding protein, SCF complex component, adherin, or 
protein encoded within a biosynthetic cluster. In certain 
other embodiments of the third aspect, the variant 
regulator protein is selected to have more activity in a 
heterologous cell and/or more activity in a homologous 
cell than the cognate, wild-type regulator protein. In 
certain embodiments, the variant regulator protein is 
selected to have more activity in a heterologous cell 
and/or more activity in a homologous cell than the 
cognate, wild- type protein and to cause more secondary 
metabolite to be produced in a homologous cell and/or a 
heterologous cell when compared to the cognate, wild-type 
regulator protein. In a particularly preferred 
embodiment, the variant regulator protein is a lovE 
variant regulator protein. 

In a fourth aspect, the invention provides a method 
of increasing production of a secondary metabolite 
comprising: (a) selecting a nucleic acid comprising a 
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polynucleotide encoding a protein regulator of secondary 
metabolite production; (b) mutating the nucleic acid to 
create a plurality of nucleic acid molecules encoding 
variant regulator proteins of secondary metabolite 
production; (c) selecting a variant regulator protein 
with more activity than the cognate, wild- type protein; 
and (d) expressing the selected variant regulator protein 
in a cell, thereby increasing production of the secondary 
metabolite in the cell. 

In various embodiments of the fourth aspect, the 
secondary metabolite is a fungal secondary metabolite. In 
certain embodiments of the third aspect, the protein 
regulator of secondary metabolite production is a 
transcription factor. In certain embodiments of the 
fourth aspect, the protein regulator of secondary 
metabolite production is a transmembrane transporter, a 
protein that mediates secretion, a kinase, a G-protein, a 
cell surface receptor, a GTPase activating protein, a 
guanine nucleotide exchange factor, a phosphatase, a 
protease, a phosphodiesterase, a bacterial protein toxin, 
an importin, an RNA-binding protein, an SCF complex 
component, an adherin, or a protein encoded within a 
biosynthetic cluster. In certain other embodiments of 
the fourth aspect, the variant regulator protein is 
selected to have more activity in a heterologous cell 
and/or more activity in a homologous cell. In certain 
embodiments, the variant regulator protein is selected to 
have more activity in a heterologous cell and/or more 
activity in a homologous cell and to cause more secondary 
metabolite to be produced in a homologous cell and/or a 
heterologous cell when compared to the cognate, wild- type 
regulator protein. In a particularly preferred 
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embodiment, the variant regulator protein is a lovE 
variant regulator protein. 

In a fifth aspect, the invention provides an 
isolated variant regulator protein of secondary 
metabolite production having increased activity compared 
to a cognate, wild- type protein, the variant regulator 
protein made by the process comprising: (a) selecting a 
nucleic acid comprising a polynucleotide encoding a 
protein regulator of secondary metabolite production; (b) 
mutating the nucleic acid to create a plurality of 
nucleic acid molecules encoding variant regulator 
proteins of secondary metabolite production; (c) 
selecting a variant regulator protein with more activity 
than the cognate, wild-type protein; and (d) recovering 
the selected variant regulator protein. 

In certain embodiments of the fifth aspect, the 
secondary metabolite is a fungal secondary metabolite. 
In certain embodiments of the fifth aspect, the protein 
regulator of secondary metabolite production is a 
transcription factor. In certain embodiments of the fifth 
aspect, the protein regulator of secondary metabolite 
production is a transmembrane transporter, a protein that 
mediates secretion, a kinase, a G-protein, a cell surface 
receptor, a GTPase activating protein, a guanine 
nucleotide exchange factor, a phosphatase, a protease, a 
phosphodiesterase, a bacterial protein toxin, an 
import in, an RNA-binding protein, an SCF complex 
component, an adherin, or a protein encoded within a 
biosynthetic cluster. .In certain embodiments of the 

fifth aspect, the variant regulator protein has more 
activity in a heterologous and/or a homologous cell than 
the cognate, wild- type protein. In certain embodiments of 
the fourth aspect, the variant regulator protein 


-9- 


BOSTON 1322173vl 


5 increases production of a secondary metabolite in a 

heterologous cell and/or a homologous cell when compared 
to the cognate, wild- type protein. In a particularly 
preferred embodiment, the variant regulator protein is a 
lovE variant regulator protein. 

10 In a sixth aspect, the invention provides a fungus 

having improved lovastatin production made by the process 
of transforming a fungal cell with a nucleic acid 
molecule encoding a lovE variant protein of the first 
aspect of the invention. In an embodiment thereof, the 

15 nucleic acid molecule is selected from a nucleic acid 
molecule of the second aspect of the invention. 

In a seventh aspect, the invention provides an 
improved process for making lovastatin comprising 
transforming a fungal cell with a nucleic acid molecule 

20 encoding a variant of the lovE protein of the first 

aspect of the invention. In an embodiment thereof, the 
fungal cell is transformed with a nucleic acid molecule 
of the second aspect of the invention. 

In a eighth aspect, the invention provides a nucleic 

25 acid molecule encoding a lovE protein defined by SEQ ID 
NO: 91. In an embodiment thereof, the invention provides 
an isolated lov£ nucleic acid molecule defined by SEQ ID 
NO: 92. 
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5 


BRIEF DESCRIPTION OF THE DRAWINGS 


Figure 1 is a photographic representation of cells 
growing on media with and without G418 selection 
10 demonstrating 2ovFp-HJS3p-Neo activation in S. 

cerevisiae . Controls include MB968 (vector only) , MB2478 
(lowly expressed wild-type lovE) , and MB1644 (highly 
expressed wild-type lovE) . All lovE variants are 
expressed in an MB968 vector backbone similar to MB2478. 

15 

Figure 2A is a graphic representation of lovFp- 
CYClp- la.cZ expression in S. cerevisiae strains expressing 
lovE variant proteins from the clones lovE 1-10. 

20 Figure 2B is a graphic representation of lovFp-CYClp-la.cZ 
expression in S. cerevisiae strains expressing lovE 
variant proteins from the clones lovE 1-10 from a 
separate transformation than that of Figure 2A. 

Figure 3 is a graphic presentation of lovFp-CYClp- 

25 lacZ expression in S. cerevisiae strains expressing lovE 
variant proteins from clones lovE 16-41. 

Figure 4 is a graphic presentation of lovFp-lacZ 
expression in S. cerevisiae strains expressing lovE 
30 variant proteins from clones lovE 1-10. 

Figure 5 is a graphic presentation of lovFp-lacZ 
expression in S. cerevisiae strains expressing lovE 
variant proteins from clones lovE 16, 20, 21, 30-34, and 
35 36-41. 

Figure 6 is a graphic presentation of lovastatin 
culture concentration, as measured by enzyme inhibition 


BOSTON 1322173vl 


-11- 


5 assay, from broths of A. terreus cultures expressing lovE 
variant proteins 1-10 in. 


Figure 7A is a graphic depiction of lovastatin 
culture concentration, as measured by HPLC analysis, from 
10 broths of A. terreus cultures expressing lovE variant 
proteins 1-10 in MF117. 

Figure 7B is a graphic depiction of lovastatin 
culture concentration, as measured by HPLC analysis, from 
15 broths of A . terreus cultures expressing lovE variant 
proteins 2, 6, 30, 32, 36, 37, 39, and 41 in MF117 . 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 


The patents and publications cited herein reflect 
the level of knowledge in the art and are hereby 
incorporated by reference in their entirety. Any 
conflict between any teaching of such references and this 
specification shall be resolved in favor of the latter. 

The invention utilizes techniques and methods common 
to the fields of molecular biology, genetics and 
microbiology. Useful laboratory references for these 
types of methodologies are readily available to those 
skilled in the art. See, for example, Molecular Cloning, 
A Laboratory Manual , 3 rd edition, edited by Sambrook, J., 
MacCallum, P., and Russell, D.W. (2001), Cold Spring 
Harbor Laboratory Press (ISBN: 0-879-69576-5) ; Current 
Protocols In Molecular Biology , edited by Ausubel, F.M., 
Brent, R., Kingston, R.E., Moore, D.D., Seidman, J.G., 
Struhl, K. (1993) , John Wiley and Sons, Inc. (ISBN: 0- 
471-30661-4) ; PCR Applications: Protocols for Functional 
Genomics , edited by Innis, M.A., Gelfand, D.H., Sninsky, 
J.J. (1999) , Cold Spring Harbor Press (ISBN: 0-123-72186- 
5) ; and Methods In Yeast Genetics , 2000 Edition: A Cold 
Spring Habor Laboratory Course Manual, by Burke, D., 
Dawson, D. and Stearns, T. , Cold Spring Harbor Press 
(ISBN: 0-879-69588-9) . 

In certain embodiments of the aspects of the 
invention, the invention relates to the biosynthesis and 
improved production of secondary metabolites. The 
invention provides variant regulator proteins useful for 
the production of secondary metabolites, nucleic acid 
molecules encoding variant regulator proteins, and 
methods for their production. 

In a first aspect, the invention provides a variant 
regulator protein of secondary metabolite production with 
increased activity relative to a cognate, wild- type 
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5 regulator protein. Particularly preferred are variant 
regulator proteins of fungal secondary metabolites. 

As used herein, the terms "fungal 11 and "fungus" 
refer generally to eukaryotic, heterotrophic organisms 
with an absorptive mode of nutrition. Fungi typically 
10 contain chitin in their cell walls and exhibit mycelial 
or yeast -like growth habits ( More Gene Manipulations in 
Fungi , edited by J.W. Bennet and L.L. Lasure, Academic 
Press Inc. (1991), ISBN 0120886421), More specifically, 
the terms refer to secondary metabolite producing 
15 organisms including, without limitation, Aspergillus sp. , 
Penicillium sp . , Acremoni um chrysogenum, Yarrowia 
lipolytica, Nodullsporlum sp . , Fusarium sp . , Monascus 
sp., Claviceps sp . , Trichoderma sp. , Tolypocladium sp. r 
Tricotheicium sp. , Fusidium sp. , Emericellopsis sp . , 
20 Cephalosporium sp . , Cochliobolus sp., Helminthosporium 
sp. , Agaricus brunescens , Ustilago maydis, Neurospora 
sp., Pestalotiopsis sp. and Phaffia rhodozyma (See, 
Funeral Physiol oqv . Chapter 9 (Secondary (Special) 
Metabolism), Griffin, D. H., John Wiley & Sons, Inc . ; 
25 ISBN: 0471166154) . 

The term "variant regulator protein" is used herein 
to refer to any regulatory protein having at least one 
change or difference in the amino acid sequence of the 
protein when compared to its cognate, wild- type 
30 regulatory protein sequence. The term does not include 
naturally occurring allelic variations of the cognate, 
wild- type regulatory protein. 

The term "regulator protein" is meant to refer to a 
protein having a positive or negative function that 
35 modifies the production of a secondary metabolite. The 
function of the protein may be at the level of 
transcription, e.g., repression or activation, protein 
synthesis, or transport. The regulator may alter the 
level of transcription, RNA stability, translation, post- 
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translational modification, or cellular localization of 
proteins involved in secondary metabolite synthesis 
and/or transport. The regulator may also have effects on 
precursor metabolite pools, flux through specific 
pathways and metabolite resistance. 

By way of non- limiting example, certain embodiments 
of the aspects of the invention relate to a regulator 
protein that is a protein that contributes and/or 
promotes transcription of a gene sequence, i.e., a 
transcription-activating protein. "Transcription- 
activating" is a term used to refer to characteristics of 
a protein that promote transcription. As used herein, a 
transcription-activating protein would include proteins 
that increase accessibility of the DNA to transcription 
complexes, for example, by opening or relaxing chromatin 
structure, proteins that promote the recognition and/or 
binding of transcription complexes to a target gene 
sequence, and/or proteins that promote transcription 
complex movement along the length of the template DNA 
sequence . 

Regulatory proteins of secondary metabolite 
production and the nucleic acid sequences encoding these 
are known to those skilled in the art. Non-limiting 
examples of regulatory proteins of secondary metabolite 
synthesis include: regulator proteins of the 
af latoxin/sterigmatocystin biosynthetic cluster 
(Woloshuk, CP., et al . , Appl , Environ. Microbiol. 
60:2408-2414 (1994) and Brown, D.W., et al . , Proc Natl 
Acad Sci USA. 93:1418-1422 (1996)); regulator proteins 
of the paxilline biosynthetic cluster (Young, C, et al . , 
Mol, Microbiol. 39:754-764 (2001)); regulator proteins of 
the cephalosporin and penicillin biosynthetic clusters 
(Litzka O., et al., Antonie Van Leeuwenhoek 75:95-105 
(1999); Schmitt E.K. and Kuck U. , J. Biol. Chem. 
275:9348-9357 (2000); MacCabe et al . Mol. Gen. Genet. 
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5 250:367-374 (1996); Suarez et al . Mol. Microbiol. 
20:529-540 (1996); Lambert et al . Mol. Cell. Biol. 
17:3966-3976 (1997); Su et al . Genetics 133:67-77 (1993); 
regulator proteins of tricothecene synthesis (Trapp S.C., 
et al., Mol. Gen. Genet. 257:421-432 (1998); Brown D.W., 

10 et al., Fungal Genet. Biol. 32:121-133 (2001); and 
Matsumoto G., et al . Biosci. Biotechnol . Biochem. 
63:2001-2004 (1999)); and regulator proteins of 
lovastatin synthesis (Kennedy, J., et al . , Science 
284:1368-1372 (1999); Hendrickson et al . , Chem. Biol. 

15 6:429-439 (1999) Tag, A. et al . , Mol Microbiol. 38:658-65 
(2000) ) . 

Certain embodiments of the aspects of the invention 
disclosed herein relate to the lovE regulator protein, a 
protein which plays a key role in the biosynthesis of 

20 lovastatin. More particularly, certain embodiments of 
the aspects of the invention relate to variant proteins 
of the lovE regulator protein and methods of making the 
same. Such proteins are variant with respect to the 
following A. terreus wild- type lovE sequences (SEQ ID 

25 NOS:91 and 92) . 


Table 1; Amino Acid and Nucleic Acid Sequences of Wild- ty pe lovE 
Wild- type lovE Amino Acid Sequence 

maadqgif tnsvtlspvegsrtggtlprraf rrscdrchaqkikctgnkevtgrapcqrc 
qqaglrcvysercpkrklrqsraadlvsadpdpclhmssppvpsqslpldvseshssnts 
rqf ldppdsydwswtsigtdeaidtdcwglsqcdggf scqleptlpdlpspf estvekap 
lppvssdiaraasaqrelf ddlsavsqeleeillavtvewpkqeiwthpigmf fnasrrl 
Itvlrqqaqadchqgtldeclrtknlf tavhcyilnvriltaiselllsqirrtqnshms 
plegsrsqspsrddtssssghssvdtipf f senlpigelf syvdplthalf sacttlhvg 
vqllreneitlgvhsaqgiaasismsgepgediartgatnsarceeqpttpaarvlfmf 1 
sdegafqeaksagsrgrtiaalrrcyedif slarkhkhgmlrdlnnipp (SEQ ID 

NO: 91) _ 

Wild- type lovE DNA Sequence 

atggctgcagatcaaggtatattcacgaactcggtcactctctcgccagtggagggttca 
cgcaccggtggaacattaccccgccgtgcattccgacgctcttgtgatcggtgtcatgca 
caaaagatcaaatgtactggaaataaggaggttactggccgtgctccctgtcagcgttgc 
cagcaggctggacttcgatgcgtctacagtgagcgatgccccaagcgcaagctacgccaa 
tccagggcagcggatctcgtctctgctgacccagatccctgcttgcacatgtcctcgcct 
ccagtgccctcacagagcttgccgctagacgtatccgagtcgcattcctcaaatacctcc 
cqqcaatttcttgatccaccqqacaqctacgactqqtcqtgqacctcq attqqcactgac 
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gaggc tat tgacactgactgctgggggctgtcccaatgtgatggaggctt cage tgtcag 
ttagagccaacgctgccggatctaccttcgcccttcgagtctacggttgaaaaagctccg 
ttgccaccggtatcgagcgacattgctcgtgcggccagtgcgcaacgagagcttttcgat 
gacctgtcggcggtgtcgcaggaactggaagagatccttctggccgtgacggtagaatgg 
ccgaagcaggaaatctggacccatcccatcggaatgtttttcaatgcgtcacgacggctt 
cttactgtcctgcgccaacaagcgcaggccgactgccatcaaggcacactagacgaatgt 
ttacggaccaagaacctctttacggcagtacactgttacatattgaatgtgcggattttg 
accgccatatcggagttgctcctgtcgcaaattaggcggacccagaacagccatatgagc 
ccactggaagggagtcgatcccagtcgccgagcagagacgacaccagcagcagcagcggc 
cacagcagtgttgacaccatacccttctttagcgagaacctccctattggtgagctgttc 
tcctatgttgaccccctgacacacgccctattctcggcttgcactacgttacatgttggg 
gtacaattgctgcgtgagaatgagattactctgggagtacactccgcccagggcattgca 
gcttccatcagcatgagcggggaaccaggcgaggatatagccaggacaggggcgaccaat 
tccgcaagatgcgaggagcagccgaccactccagcggctcgggttttgttcatgttcttg 
agtgatgaaggggctttccaggaggcaaagtctgctggttcccgaggtcgaaccatcgca 
gcactgcgacgatgctatgaggatatcttttccctcgcccgcaaacacaaacatggcatg 
ctcaqaqacctcaacaatattcctccatqa (SEQ ID NO: 92) 

5 

As used herein, the term "secondary metabolite" 
means a compound, derived from primary metabolites, that 
is produced by an organism, is not a primary metabolite, 
is not ethanol or a fusel alcohol, and is not required 
10 for growth under standard conditions. Secondary 

metabolites are derived from intermediates of many 
pathways of primary metabolism. These pathways include, 
without limitation, pathways for biosynthesis of amino 
acids, the shikimic acid pathway for biosynthesis of 
15 aromatic amino acids, the polyketide biosynthetic pathway 
from acetyl coenzyme A (CoA) , the mevalonic acid pathway 
from acetyl CoA, and pathways for biosynthesis of 
polysaccharides and peptidopolysaccharides . 
Collectively, secondary metabolism involves all primary 
20 pathways of carbon metabolism. Particularly preferred in 
embodiments of the aspects of the invention are fungal 
secondary metabolites (See, Fungal P hysiology, Chapter 9 
(Secondary (Special) Metabolism), Griffin, D. H., John 
Wiley & Sons, Inc.; ISBN: 0471166154). 
25 "Secondary metabolite" also includes intermediate 

compounds in the biosynthetic pathway for a secondary 
metabolite that are dedicated to the pathway for 
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synthesis of the secondary metabolite. "Dedicated to the 
pathway for synthesis of the secondary metabolite" means 
that once the intermediate is synthesized by the cell, 
the cell will not convert the intermediate to a primary 
metabolite. "Intermediate compounds" also include 
secondary metabolite intermediate compounds which can be 
converted to useful compounds by subsequent chemical 
conversion or subsequent biotransformation. As such, 
providing improved availability of such intermediate 
compounds would still lead to improved production of the 
ultimate useful compound, which itself may be referred to 
herein as a secondary metabolite. The yeast 
Saccharomyces cerevisiae is not known to produce 
secondary metabolites. 

The term "primary metabolite" means a natural 
product that has an obvious role in the functioning of 
almost all organisms. Primary metabolites include, 
without limitation, compounds involved in the 
biosynthesis of lipids, carbohydrates, proteins, and 
nucleic acids. The term "increasing the yield of the 
secondary metabolite" means increasing the quantity of 
the secondary metabolite present in the total 
fermentation broth per unit volume of fermentation broth 
or culture. 

As used herein, the phrase "modulate production of a 
secondary metabolite" refers to a positive or negative or 
desirable change in one or more of the variables or 
values that affect the process or results of production 
of the primary or secondary metabolites in a liquid or 
solid state fungal fermentation. These positive or 
negative or desirable changes include, without 
limitation, an increase or decrease in the amount of a 
primary or secondary metabolite being produced (in 
absolute terms or in quantity per unit volume of 
fermentation broth or per unit mass of solid substrate) ; 
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a decrease in the volume of the broth or the 
mass/quantity of substrate required for the production of 
sufficient quantities; a decrease in the cost of raw 
materials and energy, the time of fermentor or culture 
run, or the amount of waste that must be processed after 
a fermentor run; an increase or decrease in the specific 
production of the desired metabolite (both in total 
amounts and as a fraction of all metabolites and side 
products made by the fungus) ; an increase or decrease in 
the percent of the produced secondary metabolite that can 
be recovered from the fermentation broth or culture; and 
an increase in the resistance of an organism producing a 
primary or secondary metabolite to possible deleterious 
effects of contact with the secondary metabolite. 

In certain embodiments of aspects of the invention, 
a secondary metabolite is an anti-bacterial. An "anti- 
bacterial" is a molecule that has cytocidal or cytostatic 
activity against some or all bacteria. Preferred anti- 
bacterials include, without limitation, p-lactams. 
Preferred p-lactams include, without limitation, 
penicillins and cephalosporins and biosynthetic 
intermediates thereof. Preferred penicillins and 
biosynthetic intermediates include, without limitation, 
isopenicillin N, 6-aminopenicillanic acid (6-APA) , 
penicillin G, penicillin N, and penicillin V. Preferred 
cephalosporins and biosynthetic intermediates include, 
without limitation, deacetoxycephalosporin V (DAOC V) , 
deacetoxycephalosporin C (DAOC) , deacetylcephalosporin C 
(DAC) , 7-aminodeacetoxycephalosporanic acid (7-ADCA) , 
cephalosporin C, 7-B- ( 5 - carboxy- 5 - oxopentanamido) - 
cephalosporanic acid (keto-AD- 7ACA) , 7-B -(4- 
carboxybutanamido) -cephalosporanic acid (GL-7ACA) , and 7- 
aminocephalosporanic acid (7ACA) . 
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In certain embodiments of aspects of the invention, 
the secondary metabolite is an anti-hypercholesterolemic 
or a biosynthetic intermediate thereof. An "anti- 
hypercholesterolemic" is a drug administered to a patient 
diagnosed with elevated cholesterol levels for the 
purpose of lowering the cholesterol levels. Preferred 
anti-hypercholesterolemics include, without limitation, 
lovastatin, mevastatin, simvastatin, and pravastatin. 

According to other embodiments of the invention, a 
secondary metabolite is an immunosuppressant or a 
biosynthetic intermediate thereof. An 
"immunosuppressant" is a molecule that reduces or 
eliminates an immune response in a host when the host is 
challenged with an immunogenic molecule, including 
immunogenic molecules present on transplanted organs, 
tissues or cells. Preferred immunosuppressants include, 
without limitation, members of the cyclosporin family and 
beauverolide L. Preferred cyclosporins include, without 
limitation, cyclosporin A and cyclosporin C. 

In certain embodiments of aspects of the invention, 
the secondary metabolite is an ergot alkaloid or a 
biosynthetic intermediate thereof. An "ergot alkaloid" 
is a member of a large family of alkaloid compounds that 
are most often produced in the sclerotia of fungi of the 
genus Claviceps . An "alkaloid" is a small molecule that 
contains nitrogen and has basic pH characteristics. The 
classes of ergot alkaloids include clavine alkaloids, 
lysergic acids, lysergic acid amides, and ergot peptide 
alkaloids. Preferred ergot alkaloids include, without 
limitation, ergotamine, ergosine, ergocristine , 
ergocrypt ine , ergocornine , ergotaminine , ergosinine , 
ergocristinine , ergocrypt inine , ergocorninine , 
ergonovine, ergometr inine , and ergoclavine. 

In certain embodiments of aspects of the invention, 
the secondary metabolite is an inhibitor of angiogenesis 
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or a biosynthetic intermediate thereof. An "angiogenesis 
inhibitor" is a molecule that decreases or prevents the 
formation of new blood vessels. Angiogenesis inhibitors 
have proven effective in the treatment of several human 
diseases including, without limitation, cancer, 
rheumatoid arthritis, and diabetic retinopathy. 
Preferred inhibitors of angiogenesis include, without 
limitation, fumagillin and ovalicin. 

In certain embodiments of aspects of the invention, 
the secondary metabolite is a glucan synthase inhibitor 
or a biosynthetic intermediate thereof. A "glucan 
synthase inhibitor" is a molecule that decreases or 
inhibits the production of 1 , 3 -p-D-glucan, a structural 
polymer of fungal cell walls. Glucan synthase inhibitors 
are a class of antifungal agents. Preferred glucan 
synthase inhibitors include, without limitation, 
echinocandin B, pneumocandin B, aculeacin A, and 
papulacandin . 

In certain embodiments of aspects of the invention, 
the secondary metabolite is a member of the gliotoxin 
family of compounds or a biosynthetic intermediate 
thereof. The "gliotoxin family of compounds" are related 
molecules of the epipolythiodioxopiperazine class. 
Gliotoxins display diverse biological activities, 
including, without limitation, antimicrobial, antifungal, 
antiviral, and immunomodulat ing activities. Preferred 
members of the "gliotoxin family of compounds" include, 
without limitation, gliotoxin and aspirochlorine . 

In certain embodiments of aspects of the invention, 
the secondary metabolite is a fungal toxin or a 
biosynthetic intermediate thereof. A "fungal toxin" is a 
compound that causes a pathological condition in a host, 
either plant or animal . Fungal toxins could be 
mycotoxins present in food products, toxins produced by 
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phytopathogens , toxins from poisonous mushrooms, or 
toxins produced by zoopathogens . Preferred fungal toxins 
include, without limitation, aflatoxins, patulin, 
zearalenone , cytochalasin , griseof ulvin, ergochrome, 
cercosporin, marticin, xanthocillin, coumarins, 
tricothecenes , fusidanes, sesterpenes, amatoxins, 
malformin A, phallotoxins , pentoxin, HC toxin, 
psilocybin, bufotenine, lysergic acid, sporodesmin, 
pulcheriminic acid, sordarins, fumonisins, ochratoxin A, 
and fusaric acid. 

With some certain embodiments of aspects of the 
invention, the secondary metabolite is a modulator of 
cell surface receptor signaling or a biosynthetic 
intermediate thereof. The term "cell surface receptor" 
is as used before. Modulators of cell surface receptor 
signaling might function by one of several mechanisms 
including, without limitation, acting as agonists or 
antagonists, sequestering a molecule that interacts with 
a receptor such as a ligand, or stabilizing the 
interaction of a receptor and molecule with which it 
interacts. Preferred modulators of cell surface 
signaling include, without limitation, the insulin 
receptor agonist L-783,281 and the cholecystokinin 
receptor antagonist asperlicin. 

In certain embodiments of aspects of the invention, 
the secondary metabolite is a plant growth regulator or a 
biosynthetic intermediate thereof. A "plant growth 
regulator" is a molecule that controls growth and 
development of a plant by affecting processes that 
include, without limitation, division, elongation, and 
differentiation of cells. Preferred plant growth 
regulators include, without limitation, cytokinin, auxin, 
gibberellin, abscisic acid, and ethylene. 

In certain embodiments of aspects of the invention, 
the secondary metabolite is a pigment or a biosynthetic 
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intermediate thereof. A "pigment" is a substance that 
imparts a characteristic color. Preferred pigments 
include, without limitation, melanins and carotenoids. 

In certain embodiments of aspects of the invention, 
the secondary metabolite is an insecticide or a 
biosynthetic intermediate thereof. An "insecticide" is a 
molecule that is toxic to insects. Preferred 
insecticides include, without limitation, nodulisporic 
acid. 

In certain embodiments of aspects of the invention, 
the secondary metabolite is an anti -neoplastic compound 
or a biosynthetic intermediate thereof. An "anti- 
neoplastic 11 compound is a molecule that prevents or 
reduces tumor formation. Preferred ant i -neoplastic 
compounds include, without limitation, taxol (paclitaxel) 
and related taxoids. 

The phrase "increased activity" is used herein to 
refer to a characteristic that results in an augmentation 
of the inherent negative or positive function of the 
regulatory protein. 

The invention provides variant regulator proteins of 
secondary metabolite production with increased activity 
and methods of producing the same. The invention further 
provides for the identification of specific amino acid 
residues that are important to the functioning of 
secondary metabolite regulator proteins. By way of non- 
limiting example, variant regulator proteins of the 
secondary metabolite regulator lovE are presented herein. 

As known to those skilled in the art, certain 
substitutions of one amino acid for another may be 
tolerated at one or more amino acid residues of a wild- 
type regulator protein absent a change in the structure, 
activity and/or function of the wild- type protein. Such 
substitutions are referred to in the art as 
"conservative" substitutions, and amino acids may be 
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categorized into groups that identify which amino acids 
may be substituted for another without altering the 
structure and/or function of the protein. 

As used herein, the term "conservative substitution" 
refers to the exchange of one amino acid for another in 
the same conservative substitution grouping in a protein 
sequence. Conservative amino acid substitutions are 
known in the art and are generally based on the relative 
similarity of the amino acid side-chain substituents , for 
example, their hydrophobicity , hydrophilicity , charge, 
size, and the like. In a preferred embodiment, 
conservative substitutions typically include 
substitutions within the following groups: Group 1: 
glycine, alanine, and proline; Group 2: valine, 
isoleucine, leucine, and methionine; Group 3: aspartic 
acid, glutamic acid, asparagine, glutamine; Group 4: 
serine, threonine, and cysteine; Group 5: lysine, 
arginine, and histidine; Group 6: phenylalanine, 
tyrosine, and tryptophan. Each group provides a listing 
of amino acids that may be substituted in a protein 
sequence for any one of the other amino acids in that 
particular group. 

As stated supra, there are several criteria used to 
establish groupings of amino acids for conservative 
substitution. For example, the importance of the 
hydropathic amino acid index in conferring interactive 
biological function on a protein is generally understood 
in the art (Kyte and Doolittle, Mol . Biol. 157:105-132 
(1982) . It is known that certain amino acids may be 
substituted for other amino acids having a similar 
hydropathic index or score and still retain a similar 
biological activity. Amino acid hydrophilicity is also 
used as a criteria for the establishment of conservative 
amino acid groupings (see, e.g., U.S. Patent No. 
4, 554, 101) . 
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Information relating to the substitution of one 
amino acid for another is generally known in the art 
(see, e.g. , Introduction to Protein Architecture : The 
Structural Biology of Proteins , Lesk, A.M., Oxford 
University Press; ISBN: 0198504748; Introduction to 
Protein Structure , Branden, C.-I., Tooze, J., Karolinska 
Institute, Stockholm, Sweden (January 15, 1999) ; and 
Protein Structure Prediction: Methods and Protocols 
(Methods in Molecular Biology) , Webster, D.M. (Editor) , 
August 2000, Humana Press, ISBN: 0896036375) . 

In one embodiment of the first aspect, the invention 
provides an improved regulator protein comprising an 
amino acid sequence coding for a variant of the lovE 
protein having at least one specific mutation that gives 
rise to greater transcription-activating properties of 
the regulator protein and/or increased lovastatin 
synthesis . 

By way of non-limiting example, certain amino acid 
residues and mutations thereof in the lovE regulatory 
protein of A . terreus (SEQ ID NO: 91) are identified by 
the invention described herein. Mutations at residues 
31, 41, 52, 73, 101, 111, 133, 141, 153, 281, 367, and 
3 89 of the wild- type lovE protein of A. terreus have been 
identified as being critical for the improvement of lovE 
regulator protein function. Those mutations include: 
F31L, Q41K, Q41R, T52I, T52N, C73R, P101S, P101Q, Villi, 
S133L, E141V, E141K, C153Y, C153R, T281A, N367I, N367Y, 
P389S and P389L. Each mutation, therefore, represents a 
change of one: conservative class of amino acids for 
another. For example, the mutation F31L represents a 
change from a Group 6 amino acid residue to a Group 2 
amino acid residue at position 31 of the wild- type, lovE 
regulator protein. 

Thus, by way of non-limiting example, regulator 
proteins of this aspect of the invention include at least 
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5 one of the following mutations: (1) a Group 6 amino acid 
residue mutated to a Group 2 amino acid residue at 
position 31, for example, the mutation represented by 
F31L; (2) a Group 3 amino acid residue mutated to a Group 
5 amino acid residue at position 41, for example, the 
' 10 mutation represented by Q41K or Q41R; (3) a Group 4 amino 
acid residue mutated to a Group 2 amino acid residue at 
position 52, for example, the mutation represented by 
T52I; (4) a Group 4 amino acid residue mutated to a Group 

3 amino acid residue at position 52, for example, the 
15 mutation represented by T52N; (5) a Group 4 amino acid 

residue mutated to a Group 5 amino acid residue at 
position 73, for example, the mutation represented by 
C73R; (6) a Group 1 amino acid residue mutated to a Group 

4 amino acid residue at position 101, for example, the 
20 mutation represented by P101S; (7) a Group 1 amino acid 

residue mutated to a Group 3 amino acid residue at 
position 101, for example, the mutation represented by 
P101Q; (8) a valine amino acid residue mutated to another 
Group 2 amino acid residue at position 111, for example, 
25 the mutation represented by Villi; (9) a Group 4 amino 
acid residue mutated to a Group 2 amino acid residue at 
position 133, for example, the mutation represented by 
S133L; (10) a Group 3 amino acid residue mutated to a 
Group 2 amino acid residue at position 141, for example, 
30 the mutation represented by E141V; (11) a Group 3 amino 
acid residue mutated to a Group 5 amino acid residue at 
position 141, for example, the mutation represented by 
E141K; (12) a Group 4 amino acid residue mutated to Group 
6 amino acid residue at position 153, for example, the 
35 mutation represented by C153Y; (13) a Group 4 amino acid 
residue mutated to a Group 5 amino acid residue at 
position 153, for example, the mutation represented by 
C153R; (14) a Group 4 amino acid residue mutated to a 
Group 1 amino acid residue at position 281, for example, 


-26- 


BOSTON 1322173vl 


the mutation represented by T281A; (15) a Group 3 amino 
acid residue mutated to a Group 2 amino acid residue at 
position 367, for example, the mutation represented by 
N367I; (16) a Group 3 amino acid residue mutated to a 
Group 6 amino acid residue at position 367, for example, 
the mutation represented by N367Y; (17) a Group 1 amino 
acid residue mutated to Group 4 amino acid residue at 
position 389, for example, the mutation represented by 
P3 8 9S; and/or (18) a Group 1 amino acid residue mutated 
to a Group 2 amino acid residue at position 3 89, for 
example, the mutation represented by P3 8 9L. 

In other embodiments of the first aspect, the 
invention provides a variant of the lovE regulator 
protein with at least two, or at least three, or at least 
four, or at least five, or at least six, or at least 
seven, or at least eight, or at least nine, or at least 
ten, or at least eleven, or at least twelve, or at least 
thirteen, or at least fourteen, or at least fifteen, or 
at least sixteen, or at least seventeen, or at least 
eighteen of the above described specific mutations. 

In other embodiments of the first aspect, the 
invention provides an isolated lovE variant regulator 
protein having the sequence of SEQ ID NO: 41, SEQ ID 
NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID 
NO: 46, SEQ ID NO: 47, SEQ ID NO : 4 8 , SEQ ID NO : 4 9 , SEQ ID 
NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID 
NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID 
NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID 
NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, or SEQ ID NO: 65. 

In a second aspect, the invention provides a nucleic 
acid molecule encoding a variant regulator protein of 
secondary metabolite production of the first aspect of 
the invention. As used herein, the terms "nucleic acid" 
or "nucleic acid molecule" refer to a deoxyribonucleot ide 
or ribonucleotide polymer in either single-or double- 
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stranded form, and unless otherwise limited, would 
encompass analogs of natural nucleotides that can 
function in a similar manner as the naturally occurring 
nucleotide . 

In one embodiment of the second aspect, the 
invention provides a nucleic acid molecule encoding a 
variant protein of the lovE regulator protein of the 
first aspect of the invention. 

By way of non-limiting example, the invention 
provides a nucleic acid molecule encoding a lovE variant 
regulator protein having the sequence of SEQ ID NO: 66, 
SEQ ID NO: 67, SEQ ID NO: 68, SEQ ID NO: 69, SEQ ID NO: 70, 
SEQ ID NO: 71, SEQ ID NO: 72, SEQ ID NO: 73, SEQ ID NO: 74, 
SEQ ID NO: 75, SEQ ID NO: 76, SEQ ID NO: 78, SEQ ID NO: 79, 
SEQ ID NO: 81, SEQ ID NO: 82, SEQ ID NO: 83, SEQ ID NO: 84, 
SEQ ID NO: 86, SEQ ID NO: 87, SEQ ID NO: 88, SEQ ID NO: 89, 
or SEQ ID NO: 90 . 

Poor transformation efficiency and the lack of 
efficient selection systems frequently precludes the 
screening of large numbers of variant regulator proteins 
of secondary metabolites in the organism from which the 
regulator protein is isolated. For example, there are 
currently certain technical obstacles to the successful 
screening of large numbers of variant regulator proteins 
in the fungus A. terreus, an organism that produces the 
secondary metabolite lovastatin. 

The invention described herein takes advantage of 
the genetically tractable and experimentally amenable 
organism Saacharomyces cerevisiae for screening large 
numbers of variant regulator proteins of secondary 
metabolite production. Techniques common to the field of 
molecular biology are well developed in S. cerevisiae, 
and large numbers of vectors are available to assist the 
genetic manipulation and cloning of variant regulator 
proteins involved in secondary metabolite production. 
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Other genetically tractable organisms could also be used 
for this purpose. 

In a third aspect, the invention provides a method 
of increasing the activity of a protein that regulates 
secondary metabolite production comprising: (a) selecting 
a nucleic acid comprising a polynucleotide encoding a 
protein regulator of secondary metabolite production; (b) 
mutating the nucleic acid to create a plurality of 
nucleic acid molecules encoding variant regulator 
proteins of secondary metabolite production; and (c) 
selecting a variant regulator protein with more activity 
than the cognate, wild-type protein. 

As used herein, "mutating" is used to refer to the 
deliberate alteration of at least one nucleotide residue 
of a wild- type, cognate nucleic acid sequence encoding a 
regulator protein of secondary metabolite production. A 
deliberate alteration or change in at least one 
nucleotide residue of a polynucleotide may be 
accomplished by any method known in the art. The 
mutation (s) can be made in vivo or in vitro and can 
include random, partially random or not random, i.e., 
directed, mutagenesis techniques. 

By way of non- limiting example, in vivo mutagenesis 
can be done by placing this nucleic acid molecule in a 
cell with a high mutation frequency, i.e. a mutagenic 
strain. By way of non-limiting example, Muhlrad et al . 
(Yeast 8:79-82 (1992)) have developed a rapid method for 
localized mutagenesis of yeast genes. As a first step, 
the region of interest of a gene sequence is first 
amplified in vitro under error-prone polymerase chain 
reaction (PCR) conditions. Error-prone polymerase chain 
reaction (PCR) is a method of introducing amino acid 
changes into proteins. With this technique, mutations 
are deliberately introduced during the PCR reaction 
through the use of error-prone DNA polymerases under 
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specific reaction conditions. With the Muhlrad et al . 
procedure, the PCR product is then co- transformed with a 
gapped plasmid containing homology to both ends of the 
PCR product, resulting in in vivo recombination to repair 
the gap with the mutagenized DNA . 

There are a variety of commercially available kits 
that may be used to produce mutant nucleic acid molecules 
by error-prone PCR (see, e.g., GeneMorph™ PCR Mutagenesis 
Kit (Stratagene, La Jolla, California) ; and Diversify™ 
PCR Random Mutagenesis Kit (BD Biosciences Clontech, Palo 
Alto, CA) . Thus, a plurality of variant, i.e., mutated, 
regulator proteins of secondary metabolite production may 
be produced using established mutagenesis techniques. 

As used herein, the term "activity" refers to a 
characteristic of the regulator protein that negatively 
or positively affects the biological system to bring 
about a modulation in secondary metabolite production. By 
way of non-limiting example, the activity is the 
transcription of downstream genes involved in the 
biosynthetic pathway of the secondary metabolite of 
choice. Thus, in the present example, the phrase "more 
activity" refers to the property of a variant regulator 
protein to bring about more transcription than that 
effected by the cognate, wild- type regulator protein. 

In certain embodiments of the third aspect, the 
selected variant regulator protein has more activity in a 
fungal cell than the cognate, wild- type protein. In 
certain embodiments of the third aspect, the protein 
regulator of secondary metabolite production is a 
transcription factor. In certain embodiments of the 
fourth aspect, the protein regulator of secondary 
metabolite production is a transmembrane transporter, a 
protein that mediates secretion, a kinase, a G-protein, a 
cell surface receptor, a GTPase activating protein, a 
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guanine nucleotide exchange factor, a phosphatase, a 
protease, a phosphodiesterase, a bacterial protein toxin, 
an importin, an RNA-binding protein, an SCF complex 
component, an adherin, or a protein encoded within a 
biosynthetic cluster. . In certain other embodiments of 
the third aspect, the selected variant regulator protein 
has more activity in a heterologous cell than the 
cognate, wild- type protein. In certain embodiments 
thereof, the heterologous cell is an organism selected 
from the group consisting of S. cerevisiae, E. coli, A. 
nidulans, Candida, sp. , and N. crassa. In yet certain 
other embodiments of the third aspect, the selected 
variant regulator protein has more activity in a 
homologous cell than the cognate, wild- type protein. In 
certain embodiments thereof, the homologous cell is an 
organism selected from the group consisting of 
Aspergillus sp . , Penicillium sp . , Acremonium chrysogenum, 
Yarrowia lipolytica, Nodulisporium sp . , Fusarium sp . , 
Monascus sp . , Claviceps sp . , Trichoderma sp. , 
Tolypocladium sp . , Tricotheicium sp. , Fusidium sp. , 
Emericellopsis sp. f Cephalosporium sp. , Cochliobolus sp. t 
Helminthosporium sp . , Agaricus brunescens, Ustilago 
maydis, Neurospora sp . , Pestalotiopsis sp . , and Phaffia 
rhodozyma . 

In certain embodiments of the third aspect, the 
selected variant regulator protein has more activity in a 
heterologous cell and a homologous cell than the cognate, 
wild- type protein. In certain embodiments thereof, the 
heterologous cell is an organism selected from the group 
consisting of S. cerevisiae, E . coli, A. nidulans , 
Candida sp. , and N . crassa and the homologous cell is an 
organism selected from the group consisting of 
Aspergillus sp . , Penicillium sp . , Acremonium chrysogenum, 
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Yarrowia lipolytics, Nodulisporium sp . , Fusarium sp . , 
Monascus sp . , Claviceps sp., Trichoderma sp . , 
Tolypocladium sp . , Tricotheicium sp. r Fusidium sp,, 
Emericellopsis sp. , Cephalosporium sp. , Cochliobolus sp. , 
Helminthosporium sp. , Agaricus brunescens , Ustilago 
maydis, Neurospora sp., Pestalotiopsis sp. and Phaffia 
rhodozyma . 

As used herein, the phrase "heterologous cell 11 
refers to a system for gene expression, i.e., an organism 
for gene expression, that is one other than the organism 
from which the selected regulator protein of secondary 
metabolite production has been isolated. Preferred 
heterologous cells include, but are not limited to, S. 
cerevisiae, E. coli, A. nidulans , and Candida sp. , . and 
N. crassa. Particularly preferred are fungal 
heterologous cells. In an embodiment of the third 
aspect, the method comprises: (a) selecting a nucleic 
acid comprising a polynucleotide encoding a protein 
regulator of secondary metabolite production; (b) 
mutating the nucleic acid to create a plurality of 
nucleic acid molecules encoding variant regulator 
proteins of secondary metabolite production; and (c) 
selecting a mutagenized nucleic acid encoding a variant 
regulator protein with increased activity in a homologous 
cell than the cognate, wild- type protein. 

As used herein, the phrase "homologous cell" refers 
to a system for gene expression, i.e., an organism for 
gene expression, that is the organism from which the 
regulator protein of secondary metabolite production has 
been isolated. Preferred homologous cells are fungal 
homologous cells, including, but not limited to, 
Aspergillus sp . , Penicillium sp . , Acremonium chrysogenum, 
Yarrowia lipolytica, Nodulisporium sp . , Fusarium sp . , 
Monascus sp . , Claviceps sp . , Trichoderma sp. f 


-32- 


BOSTON 1322173vl 


Tolypocladium sp., Tricotheicium sp., Fusidlum sp., 
Emericellopsis sp . , Cephalosporium sp. , Cochliobolus sp. , 
Helminthosporium sp. , Agaricus brunescens , Ustilago 
maydis, Neurospora sp., Pestalotiopsis sp and Phaffia 
rhodozyma. (See, Fung a 1 Ph v s i o 1 oqv , Chapter 9 
(Secondary (Special) Metabolism), Griffin, D. H., John 
Wiley & Sons, Inc.; ISBN: 0471166154). 

In certain embodiments of the third aspect, the 
method further comprises selecting a variant regulator 
protein that also increases production of a secondary 
metabolite in a cell when compared to the cognate, wild- 
type protein. In certain embodiments thereof, the cell 
is a fungal cell. In certain embodiments thereof, the 
cell is a heterologous cell, preferably selected from the 
group consisting of S. cerevisiae, E. coll, A. nidulans , 
Candida sp., and N. crassa. 

In certain embodiments thereof, the cell is a 
homologous cell, preferably selected from the group 
consisting of Aspergillus sp . , Penlcilllum sp . , 
Acremonlum chrysogenum, Yarrowla llpolytlca, 
Nodulisporlum sp . , Fusarium sp., Monascus sp . , Clavlceps 
sp., Trlchoderma sp., Tolypocladium sp., Tricotheicium 
sp. , Fusidium sp., Emericellopsis sp . , Cephalosporlum 
sp. , Cochliobolus sp., Helminthosporium sp., Agarlcus 
brunescens, Ustilago maydis, Neurospora sp., 
Pestalotiopsis sp., and Phaffia rhodozyma. 

Certain embodiments of the aspects of the invention 
relate to regulator proteins that promote secondary 
metabolite production by increasing transcription of one 
or more genes involved with secondary metabolite 
production. These wild- type sequences may be selected 
for mutagenesis to create a plurality of variant 
regulator proteins. The activity of these transcription- 
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activating variant regulator proteins may be determined 
by measuring the activity of a reporter gene having the 
appropriate promoter sequences. These tests are done in 
a homologous and/or a heterologous cell. Certain 
embodiments of aspects of the invention are directed to 
fungal regulator proteins with transcription-activating 
activity that is tested in fungal heterologous and 
homologous cells. 

Reporter genes are useful for isolating 
transf ormants expressing improved variant regulator 
proteins. The reporter genes may be operably linked to a 
promoter sequence that is normally regulated by the wild- 
type regulator protein. Reporter genes include, but are 
not limited to, genes encoding p-galactosidase (lacZ) , (3- 
glucoronidase ( GUS) , p-glucosidase , amylase and 
invertase, amino acid biosynthetic genes, e.g., the yeast 
LEU2, HIS3 , LYS2, TRP1 genes (or homologous genes from 
other fungi, such as filamentous fungi, that encode 
proteins with the similar functional activities) , nucleic 
acid biosynthetic genes, e.g., the yeast URA3 and ADE2 
genes (or homologous genes from other fungi, such as 
filamentous fungi, that encode proteins with the similar 
functional activities) , the mammalian chloramphenicol 
transacetylase (CAT) gene, or any surface antigen gene 
for which specific antibodies are available. A reporter 
gene can also be a neomycin phosphotransf erase (neo) gene, 
which encodes neomycin, kanamycin resistance gene and 
G418 (geneticin) resistance gene. A reporter gene may 
encode a protein detectable by luminescence or 
fluorescence, such as green fluorescent protein (GFP) . 
Reporter genes may additionally or alternatively encode 
any protein that provides a phenotypic marker, for 
example, a protein that is necessary for cell growth or 
viability, or a toxic protein that causes cell death. 
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Alternatively, the reporter gene may encode a protein 
detectable by a color assay leading to the presence or 
absence of color . 

The choice of reporter gene will depend on the type 
of cell to be transformed. Preferred reporter genes are 
those that are operable in fungal cells . It is 
preferable to have two reporter genes within the cell. 
One reporter gene, when expressed, provides a growth 
advantage to transformed cells that are expressing the 
variant regulator protein. This allows for the isolation 
of such transf ormants though selective pressures. The 
other reporter gene provides a colorimetric marker, such 
as the la.cZ gene and its encoded protein, |3- 
galactosidase . Alternatively, the second reporter 
provides a fluorescent or luminescent marker, such as 
green fluorescent protein (GFP) . 

In a fourth aspect, the invention provides a method 
of increasing production of a secondary metabolite 
comprising: (a) selecting a nucleic acid comprising a 
polynucleotide encoding a protein regulator of secondary 
metabolite production; (b) mutating the nucleic acid to 
create a plurality of nucleic acid molecules encoding 
variant regulator proteins of secondary metabolite 
production; (c) selecting a variant regulator protein 
with more activity than the cognate, wild- type protein; 
and (d) expressing the selected variant regulator protein 
in a cell, thereby increasing production of the secondary 
metabolite in the cell. 

In certain embodiments of the fourth aspect, the 
cell is a fungal cell. In certain embodiments of the 
fourth aspect, the protein regulator of secondary 
metabolite production is a transcription factor. In 
certain embodiments of the fourth aspect, the protein 
regulator of secondary metabolite production is a 
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transmembrane transporter, a protein that mediates 
secretion, a kinase, a G-protein, a cell surface 
receptor, a GTPase activating protein, a guanine 
nucleotide exchange factor, a phosphatase, a protease, a 
phosphodiesterase, a bacterial protein toxin, an 
importin, an RNA-binding protein, an SCF complex 
component, an adherin, or a protein encoded within a 
biosynthetic cluster. In certain embodiments of the 

fourth aspect, the cell is a heterologous cell, 
preferably selected from the group consisting of S. 
cerevisiae, E. call, A. nidulans, Candida sp . , and N. 
crassa. In certain other embodiments of the fourth 
aspect, the cell is a homologous cell, preferably 
selected from the group consisting of Aspergillus sp . , 
Penlcllllum sp . , Acremonium chrysogenum, Yarrowia 
llpolytica, Nodulisporium sp . , Fusarium sp . , Monascus 
sp,, Claviceps sp., Trichoderma sp. f Tolypocladium sp . , 
Tricotheicium sp. , Fusldlum sp. , Emericellopsis sp. , 
Cephalosporium sp. , Cochllobolus sp. , Helmlnthosporium 
sp. , Agarlcus brunescens , Ustllago maydis, Neurospora 
sp., Pestalotlopsls sp . , and Phaffla rhodozyma. 

In certain other embodiments of the fourth aspect, 
the cell is a heterologous cell and the method further 
comprises expressing the variant regulator protein in a 
homologous cell, thereby increasing secondary metabolite 
production in the homologous cell. In certain 
embodiments thereof, the heterologous cell is an organism 
selected from the group consisting of S. cerevlsiae, E . 
coll, A. nldulans, Candida sp. , , and N. crassa and the 
homologous cell is an organism selected from the group 
consisting of Aspergillus sp . , Penlcllllum sp . , 
Acremonlum chrysogenum, Yarrowla llpolytica, 
Nodulisporium sp . , Fusarium sp . , Monascus sp . , Claviceps 
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sp., Trichoderma sp. , Tolypocladium sp., Tricotheicium 
sp. , Fusidium sp. , Emericellopsis sp. , Cephalosporium 
sp. , Cochliobolus sp. , Helminthosporium sp. , Agaricus 
brunescens, Ustllago rnaydis, Neurospora sp . , 
Pestalotiopsis sp.and Phaffia rhodozyma. 

In a fifth aspect, the invention provides an 
isolated variant regulator protein of secondary 
metabolite production having increased activity compared 
to a cognate, wild- type protein, made by the process 
comprising: (a) selecting a nucleic acid comprising a 
polynucleotide encoding a protein regulator of secondary 
metabolite production; (b) mutating the nucleic acid to 
create a plurality of nucleic acid molecules encoding 
variant regulator proteins of secondary metabolite 
production; (c) selecting a variant regulator protein 
with more activity than the cognate, wild- type protein; 
and (d) recovering the selected variant regulator 
protein . 

In certain embodiments of the fifth aspect, the 
variant regulator protein selected has more activity in a 
fungal cell. In certain embodiments of the fifth aspect, 
the protein regulator of secondary metabolite production 
is a transcription factor. In certain embodiments of the 
fifth aspect, the protein regulator of secondary 
metabolite production is a transmembrane transporter, a 
protein that mediates secretion, a kinase, a G-protein, a 
cell surface receptor, a GTPase activating protein, a 
guanine nucleotide exchange factor, a phosphatase, a 
protease, a phosphodiesterase, a bacterial protein toxin, 
an importin, an RNA-binding protein, an SCF complex 
component, an adherin, or a protein encoded within a 
biosynthetic cluster. In certain embodiments of the 

fifth aspect, the variant regulator protein selected has 
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more activity in a heterologous cell, preferably selected 
from the group consisting of S. cerevislae , E. coll, A. 
nidulans , Candida sp. , Neurospora sp . , Pestalotlopsls 
sp. , and N. crassa. In certain embodiments of the fifth 
aspect, the variant regulator protein selected has more 
activity in a homologous cell, preferably selected from 
the group consisting of Aspergillus sp . , Penlcllllum sp., 
Acremonlum chrysogenum, Yarrowla lipolytica, 
Nodulisporlum sp., Fusarlum sp . , Monascus sp . , Claviceps 
sp., Trlchoderma sp., Tolypocladlum sp., Tricotheicium 
sp. f Fusldlum sp. , Emericellopsis sp . , Cephalosporium 
sp. , Cochllobolus sp. , Helmlnthosporium sp . , Agarlcus 
brunescens, Ustilago maydis, Neurospora sp., 
Pestalotlopsls sp., and Phaffla rhodozyma. 

In certain embodiments of the fifth aspect, the 
variant regulator protein selected has more activity in a 
homologous cell and a heterologous cell. In embodiments 
thereof, the heterologous cell is an organism selected 
from the group consisting of S. cerevlsiae, E. coll, A. 
nldulans, Candida sp. f Neurospora sp . , Pestalotlopsls 
sp., and N. crassa and the homologous cell is an organism 
selected from the group consisting of Aspergillus sp . , 
Penlclllium sp . , Acremonlum chrysogenum, Yarrowla 
lipolytica, Nodulisporlum sp . , Fusarlum sp., Monascus 
sp., Clavlceps sp . , Trlchoderma sp. t Tolypocladlum sp . , 
Tricothelcium sp. , Fusldlum sp. , Emericellopsls sp., 
Cephalosporium sp. , CochlioJboIus sp. , Helminthosporium 
sp. , Agaricus brunescens, Ustilago maydis , Neurospora 
sp., Pestalotlopsls sp. f and Phaffla rhodozyma. 

In yet another embodiment of the fifth aspect, the 
variant regulator protein is a variant protein of the 
lovE protein having at least one of the following 
mutations: (1) a Group 6 amino acid residue mutated to a 
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Group 2 amino acid residue at position 31, for example, 
the mutation represented by F31L; (2) a Group 3 amino acid 
residue mutated to a Group 5 amino acid residue at 
position 41, for example, the mutation represented by 
Q41K or Q41R; (3) a Group 4 amino acid residue mutated to 
a Group 2 amino acid residue at position 52, for example, 
the mutation represented by T52I; (4) a Group 4 amino 
acid residue mutated to a Group 3 amino acid residue at 
position 52, for example, the mutation represented by 
T52N; (5) a Group 4 amino acid residue mutated to a Group 
5 amino acid residue at position 73, for example, the 
mutation represented by C7 3R; (6) a Group 1 amino acid 
residue mutated to a Group 4 amino acid residue at 
position 101, for example, the mutation represented by 
P101S; (7) a Group 1 amino acid residue mutated to a 
Group 3 amino acid residue at position 101, for example, 
the mutation represented by P101Q; (8) a valine amino 
acid residue mutated to another Group 2 amino acid 
residue at position 111, for example, the mutation 
represented by Villi; (9) a Group 4 amino acid residue 
mutated to a Group 2 amino acid residue at position 133, 
for example, the mutation represented by S133L; (10) a 
Group 3 amino acid residue mutated to a Group 2 amino 
acid residue at position 141, for example, the mutation 
represented by E141V; (11) a Group 3 amino acid residue 
mutated to a Group 5 amino acid residue at position 141, 
for example, the mutation represented by E141K; (12) a 
Group 4 amino acid residue mutated to Group 6 amino acid 
residue at position 153, for example, the mutation 
represented by C153Y; (13) a Group 4 amino acid residue 
mutated to a Group 5 amino acid residue at position 153, 
for example, the mutation represented by C153R; (14) a 
Group 4 amino acid residue mutated to a Group 1 amino 
acid residue at position 281, for example, the mutation 
represented by T281A; (15) a Group 3 amino acid residue 
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mutated to a Group 2 amino acid residue at position 3 67, 
for example, the mutation represented by N3 6 7I; (16) a 
Group 3 amino acid residue mutated to a Group 6 amino 
acid residue at position 3 67, for example, the mutation 
represented by N3 6 7Y; (17) a Group 1 amino acid residue 
mutated to Group 4 amino acid residue at position 389, 
for example, the mutation represented by P3 8 9S; and/or 
(18) a Group 1 amino acid residue mutated to a Group 2 
amino acid residue at position 3 89, for example, the 
mutation represented by P3 89L. 

In certain embodiments of this aspect of the 
invention, the variant protein of the lovE protein 
sequence has an amino acid sequence of SEQ ID NO: 41, SEQ 
ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ 
ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO : 4 9 , SEQ 
ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ 
ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ 
ID NO:58, SEQ ID NO:59, SEQ ID NO: 60, SEQ ID NO:61, SEQ 
ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, or SEQ ID NO: 65. 

In another embodiment thereof, the variant protein 
of the lovE protein is encoded by a nucleic acid molecule 
having a polynucleotide sequence of SEQ ID NO: 66, SEQ ID 
NO: 67, SEQ ID NO: 68, SEQ ID NO: 69, SEQ ID NO: 70, SEQ ID 
NO: 71, SEQ ID NO: 72, SEQ ID NO: 73, SEQ ID NO: 74, SEQ ID 
NO: 75, SEQ ID NO: 76, SEQ ID NO: 78, SEQ ID NO: 79, SEQ ID 
NO: 81, SEQ ID NO: 82, SEQ ID NO: 83, SEQ ID NO: 84, SEQ ID 
NO: 86, SEQ ID NO: 87, SEQ ID NO: 88, SEQ ID NO: 89, or SEQ 
ID NO: 90. 

In a sixth aspect, the invention provides a fungus 
having improved lovastatin production made by the process 
of transforming a fungal cell with a nucleic acid 
molecule encoding a variant of the lovE protein of the 
first aspect of the invention. In an embodiment thereof, 
the nucleic acid molecule is selected from a nucleic acid 
molecule of the second aspect of the invention. 
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In a seventh aspect, the invention provides an 
improved process for making lovastatin comprising 
transforming a fungal cell with a nucleic acid molecule 
encoding a variant of the lovE protein of the first 
aspect of the invention. In an embodiment thereof, the 
fungal cell is transformed with a nucleic acid molecule 
of the second aspect of the invention. 

International Patent Application PCT/US99/29583 
discloses lovastatin production genes. However, this 
reference does not provide a mature lovE cDNA sequence. 
The invention herein remedies the shortcoming of this 
reference by providing a complete cDNA sequence for the 
lovE mRNA . 

In an eighth aspect, the invention provides a 
nucleic acid molecule encoding a lovE protein defined by 
SEQ ID NO: 91. In an embodiment thereof, the invention 
provides an isolated lovE nucleic acid molecule defined 
by SEQ ID NO: 92 . 

The following examples illustrate the preferred 
modes of making and practicing the present invention but 
are not meant to limit the scope of the invention since 
alternative methods may be utilized to obtain similar 
results . 

EXAMPLES 

Example 1: Preparation of Strains and Plasmids 

Strain MY2124 was derived from the Sigma 1278b 
strain background of S. cerevisiae and its complete 
genotype is as follows: MATa/MATOC: : LEU2 ura3A0 /ura3A0 
leu2A0/leu2A0 trplAO : :hisG/trplAO : :hisG 

his3A0: :h±sG/h±s3A0 : ihisG ura3A0 : : lovF-HIS3p-neo/ura3A0 . 
MY2124 can be constructed by mating S. cerevlsiae strains 
MY2112 {MATa ura3A0 leu2A0 trplAO : :hisG his3A0 : ihisG 
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5 ura3A0 : : lovFp-HIS3p-neo) with MY1555 {mat(X::LEU2 ura.3A0 
leu2A0 trplAO::hisG h±s3A0 : : hisG) and isolating zygotes. 
The ura3A0 : : lovFp -HIS3p-neo allele of MY2112 was derived 
by cotransf orming Sfil - linearized plasmid MB2254 with 
pRS424 (Sikorski and Hieter (1989) Genetics 122:19-27) 

10 into MY1413 (MATa leu2A0 trplAO : : hisG his3A0 : : hisG) . 
Trans formants were selected on SC-Trp media and 
subsequently screened for 5-f luoro-orotic acid resistance 
to identify those transf ormants containing the 
UT3L3A0: : lovFp-HIS3p-neo allele. Trp" segregants lacking 

15 plasmid pRS424 were isolated by growing the strain under 
non-selective conditions . 

The following oligonucleotides were used in the 
construction of plasmids. 


Table 

2: Olicronucleotides Utilized For LovE Variant Cloning 

M06 64 ( 5 ' GGCCATGGAGGCCGCTAGCTCGAGTCGACGGCCTAGGTGGCCAGCT3 ' ) 
(SEQ ID NO:l) 

M06 65 ( 5 ' GGCCACCTAGGCCGTCGACTCGAGCTAGCGGCCTCCATGGCCGTAC3 ' ) 
(SEO ID NO: 2) 

MO666 

( 5 ' GGCGGCCGCTCTAGAACTAGTCTCGAGGGTACC3 ' ) (SEQ ID NO : 3 ) 

M0667 

( 5 ' GGTACCCTCGAGACTAGTTCTAGAGCGGCCGCC 3 ' ) ( SEQ ID NO : 4 ) 

M017 94 

( 5 ' CACAGCGGCCGCTCAACCTTCCCATTGGGGC3 ' ) ( SEQ ID NO : 5 ) 

M017 93 

( 5 ' CACCACTAGTACGCGGGCTGATTCGAC3 ' ) (SEQ ID NO : 6 ) 

M017 85 

( 5 ' CACCACTAGTTATACATTATATAAAGTAATGTG3 ' ) ( SEQ ID NO : 7 ) 

M017 86 

( 5 ' CACAGGATCCGTCATCTTTGCCTTCGTTTATC3 ' ) ( SEQ ID NO : 8 ) 

M0195 

( 5 ' CGCGGATCCTATTGAACAAGATGGATTGCAC3 ' ) ( SEQ ID NO : 9 ) 

M0196 

( 5 ' C CGGAATT C AG AAG AAC T CGT C AAGAAG 3 ' ) ( SEQ ID NO : 1 0 ) 

M0841 

ID NO: 

( 5 ' ACAAAAAAGCAGGCTCCACAATGGCTGCAGATCAAGGTAT3 1 ) ( SEQ 
11) 

M0842 

NO: 12) 

(5 1 ACAAGAAAGCTGGGTTCATGGAGGAATATTGTTGA3 ' ) (SEQ ID 

M02278 

( 5 ' GGGGAT C C AAT CG AGGT C C ACGAC C AGT 3 1 ) (SEQ ID NO : 13 ) 

M0343 

( 5 ' GGGGACAAGTTTGTACAAAAAAGCAGGCT 3 ' ) (SEQ ID NO : 14 ) 

M02273 

( 5 ' GGGGAT CCGCCAATGGTCCCGTTCAAAC 3 1 ) ( SEQ ID NO : 15 ) 

M02274 

NO: 16) 

(5 1 ACAAGAAAGCTGGGTTCACAGAATGTTTAGCTCAA3 1 ) (SEQ ID 

M0344 

( 5 1 GGGGAC C ACTTTGTAC AAGAAAGCTGGGT 3 1 ) ( SEQ ID NO : 17 ) 

M02624 

NO: 18) 

(5 1 GCGATGC C C C AAGCGC AAGCTACGC C AAT CC AGGG 3 ' ) (SEQ ID 

M02 654 

NO: 19) 

(5 1 CGTCGCGCCATTCGCCATTCAGGCTGCGCAACTGT3 1 ) (SEQ ID 
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MQ2 680 ( 5 1 GGACCTTTGCAGCATAAATTACTATACTTCT3 1 ) ( SEQ ID NO : 2 0 ) 
M02 6 86 ( 5 1 GGCGCGTCCATTCGCCATTCAGGCTGCGCAACTGT3 1 ) ( SEQ ID 

NO : 2 1 ) 

MQ2 681 ( 5 1 TAAAACTCTTGTTTTCTTCTTTTCTCTAAAT3 1 ) (SEQ ID NO : 22 ) 
M02 7 00 ( 5 1 CAGTGAGCGCGCGTAATACGACTCACTATAGGGCGA3 ' ) ( SEQ ID 

NO: 23) 

MO27 01 (5 1 ATACTTCTATAGACACACAAACACAAATACACACAC3 1 ) (SEQ ID 

NO: 24) . 

MO107 ( 5 1 CGCGGATC CCGTCGTTTTAC AAC 3 1 ) (SEQ ID NO : 2 5 ) 

MQ197 ( 5 1 CCCAAGCTTATTATTTTTGACACCAGACCAA3 1 ) ( SEQ ID NO : 2 6 ) 
M012 93 ( 5 1 GGAAGATCTAGCATCGTGGCCAATTTCTTCTAGTTT3 ' ) ( SEQ ID 

NO:27) 

M012 94 ( 5 ' ATAAGAATGCGGCCGCTCAACCTTCCCATTGGGGCGTTTGC3 ' ) ( SEQ 

ID NO: 28) 

M017 87 (5 1 CACAGGATCCAGCATTATTAATTTAGTGTGTGTATTT3 1 ) (SEQ ID 

NO : 2 9 ) 

MQ17 8 8 ( 5 1 CACCACTAGTCTCGAGCAGATCCGCCAG3 ' ) ( SEQ ID NO : 3 0 ) 
MQ17 93 ( 5 1 CACCACTAGTACGCGGGCTGATTCGAC3 1 ) ( SEQ ID NO : 3 1 ) 
MQ1794 (5 1 CACAGCGGCCGCTCAACCTTCCCATTGGGGC3 ' ) (SEQ ID NO:32) 
M0511 (5 1 GGCCATCGATACAAGTTTGTACAAAAAAGCTGAAC3 1 ) (SEQ ID 

NO: 33) . 

MQ54 0 ( 5 1 GGCGCCCTATTACACCACTTTGTACAAGAAAGC3 1 ) ( SEQ ID NO : 3 4 ) 
M01985 ( 5 1 CACACGTCTCCGGCCTCAACCTTCCCATTGGGGCG3 ' ) (SEQ ID 

NO : 3 5 ) 

M01986 (5 1 CACACAGATCTCGTGGCCAATTTCTTCTAGTTTGA3 1 ) (SEQ ID 

NO : 3 6 ) \ 

M01992 (5 1 CACACGGATCCACAATGTTACGTCCTGTAGAAACCCC3 1 ) (SEQ ID 

NO: 37) 

M01993 (5 1 CACAGCGGCCGCTTCATTGTTTGCCTCCCTGCTG3 1 ) (SEQ ID 

NO : 3 8 ) t 

M03 16 ( 5 1 GCGGCCGCGGCGCCCGGCCCATGTCAACAAGAAT3 ' ) ( SEQ ID 

NO : 3 9 ) J 

MQ3 18 ( 5 1 CCGCGGCCGAGTGGAGATGTGGAGT3 1 ) ( SEQ ID NO : 4 0 ) 


Plasmid MB2254 contains the lovFp-HIS3p- neo reporter 
gene flanked by URA3 sequence. First primers M0664 (SEQ 
ID NO:l) and M0665 (SEQ ID NO : 2 ) were annealed and 

10 inserted into the Kpnl-Sacl sites of plasmid pBluescript 
II KS (Stratagene, ) . The resulting vector, MB1038, 
contains a Sail site in the polylinker. Next, the Spel- 
Xhol fragment from pJL164 (Brachmann et al . Yeast 14:115- 
132 (1998)) containing a deletion of the URA3 gene with 

15 additional flanking sequences was inserted into the Nhel- 
Sall sites of MB1038 to create MB1053 . Primers MO666 
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(SEQ ID NO: 3) and M0667 (SEQ ID NO : 4 ) that contain 
multiple restriction sites (NotI , Xbal, Spel r Xhol and 
Kpnl) were then annealed together and ligated into the 
Smal site of MB1053 to create MB1054. Next, the 
following four fragments were combined in MB1054 to 
obtain plasmid MB2254. The lovF promoter from A. terreus 
genomic DNA was PCR amplified with M01794 (SEQ ID NO: 5) 
and M01793 (SEQ ID NO : 6 ) and inserted into MB1054 on a 
Notl-Spel fragment. The HIS3 basal promoter from pRS403 
(Sikorski and Hieter, Genetics 122:19-27 (1989)) was PCR 
amplified with primers M01785 (SEQ ID NO: 7) and M01786 
(SEQ ID NO: 8) and inserted into MB1054 on a Spel-BamHI 
fragment. Finally, the neo gene (PCR amplified with 
M0195 (BamHI) (SEQ ID NO:) and M0196 (EcoRT) (SEQ ID 
NO: 10) from plasmid pYXll (Xiao and Weaver, Nucl . Acids 
Res. 25:2985-2991 (19 97 J) and CYC1 terminator sequences 
(Xhol-Kpnl fragment from pRS426-GAi-S (Mumberg, et al . , 
Nucl. Acids. Res. 22:5767-5768 (1994)) were first 
combined in pRS416 (Sikorski and Hieter, Genetics 122:19- 
27 (1989)) and then cut out with BamHI-Xpnl and inserted 
into MB1054 to create MB2254 . 

The lovFp-HIS3p-neo reporter in MY2124 can confer 
resistance to the drug geneticin (G418) . It was 
empirically determined that MY2124 (untransf ormed or 
transformed with parental plasmids MB2478 (CYC1- lovE/ CEN) 
or MB2848 (CYC1- lovE/ At2 74/ CEN) was unable to grow on YPD 
media supplemented with 100 |lg /ml G418. Plasmid MB2478 
contains the CYC1 promoter operationally linked to the 
entire A . terreus lovE open reading frame. The CYC1 
promoter is a relatively weak promoter and thus the lovE 
ORF in MB2478 was expressed at low levels. MB2478 was 
the parental vector plasmid for creating full length lovE 
variants. Plasmid MB2848 contains the CYC1 promoter 
operationally linked to a chimeric open reading frame 
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5 consisting of the A, terreus lovE DNA binding domain 

fused to the carboxy- terminal portion of the At274 gene 
(U.S. Serial No. 60/257,431, filed December 22, 2000). 

MB2 84 8 was used to create lovE variants in which the 
DNA binding domain was not mutated. Both MB24 7 8 and 

10 MB2 84 8 contain yeast CEN and autonomously replicating 

sequences and both are maintained at 1-2 copies per cell. 
In contrast to strains transformed with MB2478 or MB2 848, 
strains transformed with plasmid MB1644 (TEFl-lovE/2 
micron) were able to grow on G4 18 -supplemented YPD media. 

15 The lovE gene of MB1644 is under control of the 

const itutively strong S. cerevisiae TEF1 promoter. 
MB1644 contains a 2 -micron origin for high-copy 
replication in yeast. An objective of these studies was 
to identify lovE variants which when expressed at low 

20 levels could confer G418 resistance similar to the highly 
expressed wild-type lovE molecule of MB1644. S. 
cerevisiae expression vectors used in these studies were 
constructed as follows. 

MB96 8 is a low copy S. cerevlslae URA3 based 

25 expression vector. MB968 was created by inserting the 

EcoRV fragment (containing the destination cassette) from 
gateway pEZC72 01 ( Invitrogen™ , Carlsbad, CA) into 
Xhol/ Sail (filled in with Klenow) linearized pRS416 CYC1 
(Mumberg, et al . , Gene 156:119-122 (1995)). 

30 MB1644 and MB247 8 are URA3 -based S. cerevisiae 

expression plasmids that contain the wild- type lovE gene. 
They are both derivatives of MB1199. MB1199 was created 
by using primers M0841 (SEQ ID NO: 11) and M0842 (SEQ ID 
NO: 12) to amplify the lovE ORF from A . terreus cDNA . 

35 Gateway (Invitrogen™, Carlsbad, CA) Cloning Technology 
(US Patent 5,888,732) was used to clone the lovE PCR 
fragment into the gateway entry vector pDONR206 
(Invitrogen™, Carlsbad, CA) to create MB1199. Similarly, 
Gateway Cloning Technology was used to transfer the lovE 
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ORF from MB1199 into MB968 to create MB2478 and into 
MB969 (U.S. Serial No. 60/198,335, filed April 18, 2000) 
to create MB1644 . 

MB2848 is a derivative of MB968 that contains a 
lovE-AT2 74 chimera. The lovE portion of MB2 84 8 was 
derived by using oligos M0841 (SEQ ID NO: 11) and M02278 
(SEQ ID NO: 13) to PCR amplify the lovE DNA binding domain 
from A. terreus cDNA. A second round of PCR was 
performed with primers M0343 (SEQ ID NO: 14) and M02278 to 
add appropriate Gateway Cloning Technology compatible 
sequences. The At274 portion of MB2848 can be derived by 
using primers M02273 (SEQ ID NO: 15) and M02274 (SEQ ID 
NO: 16) to PCR amplify the carboxy- terminal domain of 
At2 74 from A . terreus cDNA . A second round of PCR was 
performed with primers M0344 (SEQ ID NO: 17) and M02273 to 
add appropriate Gateway Cloning Technology compatible 
sequences. The lovE and At274 PCR products were cut with 
BamHI and purified over a QIAquick PCR purification kit 
(Qiagen, Valencia, CA) according to manufacturer's 
instructions. Finally, the products were mixed 3-4 hours 
in a standard ligation reaction and used in Gateway entry 
and destination reactions to create MB2848. 

Gateway cloning technology was used to clone the 
lovE variants of interest into plasmid MB1419 which is a 
filamentous fungal expression vector. The MB1419 fungal 
selection marker is the A. nidulans GPD promoter 
controlling the jble gene from S. hindustanus . The 
transgene is controlled by the A . nldulans PGK promoter. 
A. terreus strain MF117 is a derivative of A . terreus 
strain ATCC 20542. 

Example 2: PCR Mutagenesis of the lovE DNA Binding Domain 

The zinc finger DNA binding domain of lovE is encoded 
by nucleotides 100-201 (SEQ ID NO:92). Oligos M02624 


-46- 


BOSTON 1322173vl 


( SEQ ID NO: 18) and M02654 (SEQ ID NO: 19) were used to PCR 
amplify a lovE containing fragment from plasmid MB2478. 
The 1.7 kb product contains nucleotides 212-1410 of lovE 
and -500 bp of flanking vector sequence. Two rounds of 
standard PCR (1.5 mM MgCl 2 ) were performed with Amplitaq 
DNA polymerase (Applied Biosystems, Foster City, Ca) 
according to the manufacturer's instructions. 

Plasmid MB2848 was cut with Kpnl-BamHI to release a 1.1 
kb fragment containing the At2 74 portion of the lovE- 
At2 74 chimeric open reading frame. The remaining 5.5 kb 
vector sequence retains the lovE DNA binding domain. 

Example 3: PCR Mutagenesis of the lovE Open Reading Frame 

lovE open reading frame insert was prepared 
according to the following procedure. Oligo pairs MO2680 
(SEQ ID NO:20) /M02686 (SEQ ID N0:21), M02681 (SEQ ID 
NO:22) /M02686, and MO2700 (SEQ ID NO:23) /MO2701 (SEQ ID 
NO: 24) were used to PCR amplify the entire lovE open 
reading frame from plasmid MB2478. The PCR products 
differ in the amount of 5 1 and 3' vector sequence 
flanking the lovE open readiing frame. 

PCR was performed using a GeneMorph PCR mutagenesis 
kit (Stratagene, La Jolla, Ca) according to 
manufacturer's instructions to achieve medium and high 
range mutation frequencies. 

Plasmid MB2478 was cut with Asp718/Xbal to release a 
1.7 kb fragment. The remaining 5.0 kb vector sequence 
completely lacks lovE ORF sequence. 

Example 4: Transformation and Selection for G418R 
Isolates 

All PCR products were purified using a QIAquick PCR 
purification kit (Qiagen) according to manufacturer's 
instructions. All vectors were gel purified using a 
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QIAquick gel extraction kit (Qiagen) according to 
manufacturer's instructions. 

The mutagenesis strategy of Muhlrad et al . (Yeast 
8:79-82 (1992) ) was used which involves cotransf orming a 
mutated PCR product and gapped plasmids into S. 
cerevisiae , and then screening for in vivo recombinants 
having the desired phenotype) . 

Transformation of Saccharomyces cerevisiae was 
accomplished by the lithium acetate/single-stranded 
carrier DNA/polyethylene glycol (LiAc/ss -DNA/PEG) 
protocol (Woods R . A . and Gietz R.D. Methods Mol . Biol. 
177:85-97 (2001)) with a 1:5 molar ratio of vector : insert 
DNA to generate >55,000 in vivo recombinant transf ormants 
on SC-Ura plates. Transf ormants were transferred by 
replica printing to YPD plates containing 100 |ig/ml G418 
and allowed to grow for 2-4 days at 30°C (Figure 1) . 

Drug resistant clones were confirmed in secondary 
assays including growth on G418 concentrations up to 2 000 
jug /ml . The plasmid-dependence of the phenotype was 
determined by observing the re -appearance of drug 
sensitivity correlating with loss of the library plasmid. 
lovE variant plasmids were recovered from promising 
candidates (Hoffman and Winston (1986) Gene 57:267). 
More than 70 lovE variants were identified and 
definitively characterized by DNA sequence and/or 
restriction digestion analysis. 

Table 3 summarizes the G418 resistance phenotype and 
sequence analysis of 26 of these variants. 
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Table 4 summarizes amino acid substitutions that 
were isolated multiple times, suggesting that they are 
particularly important for improving lovE variant 
activity on 1 ovFp - HIS3p -neo expression. 

10 

Table 4: lovE Mutations Isolated Multiple Times 


Amino Acid Change 

Number of Times 

lovE variant 

Isolated in lovE 1-41 


F31L 

4 

20, 21 , 31 , 34 

Q41K 

2* 

10, 16 

Q41R 

3* 

33, 38, 40 

T52I/T52N 

1 each 

34, 39 

C73R 

6* 

3, 4, 5, 7, 8, 9 

P101S/P101Q 

1 each 

31, 34 

V111I 

2 

34, 39 

S133L 

2 

2, 41 

E141V, E141K 

1 each 

8, 40 

C153Y/C153R 

1 each 

6, 31 

T281A 

2 

6, 39 

N367I/N367Y 

2/1 

21, 40, 37 

P389S/P389L 

1 each 

32, 38 


* allele was isolated in additional lovE variants that were not fully sequenced 


Example 5: Increased lovF-lacZ Expression in S . 
cerevisiae 

15 in order to quantify the increase in lovF 

expression, p-galactosidase activity was measured in lovE 
variant transformed S. cerevlsiae strains that also 
harbored lovFp-lacZ reporter derivative plasmids . lovF- 
la.cZ reporter derivative plasmids were constructed as 

20 follows. 

Plasmid MB1918 contains the lovFp-la.cZ reporter 
gene. It can be derived from pRS424 (Sikorski and Hieter 
(1989) Genetics 122:19-27). First, primers MO107 (SEQ 
ID NO: 25) and M0197 (SEQ ID NO: 26) are used to PCR 

25 amplify the lacZ gene from Yep355 (Myers, et al . , Gene 
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5 45:299-310 (1986)). This lacZ - containing fragment was 

inserted into the BamHI -ffindlll sites of pRS416 (Sikorski 
and Hieter, Genetics 122:19-27 (1989)). This same lacZ 
fragment can be cut out of the resulting vector with 
Kpnl-Notl and inserted into the same sites of pRS424 to 

10 create pRS424-lacZ. Primers M01293 (SEQ ID NO:27) and 

M012 94 (SEQ ID NO: 28) are used to PCR amplify a 2.0 9 kb 
fragment of the lovF promoter from A. terreus genomic 
DNA. The lovF promoter fragment was then cut with Notl- 
Bglll and inserted into Notl-BamHl linearized pRS424- 

15 lacZ. 

Plasmid MB2114 contains the lovFp- CYClp- lacZ 
reporter gene. It can be derived from pRS424-lacZ (see 
MB1918 plasmid construction) . Primers M01787 (SEQ ID 
NO:29) and M01788 (SEQ ID NO : 3 0 ) are used to amplify the 

20 264 bp basal CYC1 element from pRS415 CYC1 (Mumberg, et 
al . , Gene 156:119-122 (1995)). This 264 bp fragment was 
inserted upstream of the pRS424-lacZ derivative which has 
been digested with Spel-BamHI. Finally, the lovF 
promoter from MB1918 was PCR amplified with M01793 (SEQ 

25 ID NO: 31) and M01794 (SEQ ID NO: 32) and inserted into the 
Notl-Spel sites to create MB2114. 

Yeast strains utilized in this study include strains 
MY2145 and MY2159, which are both derived from the S. 
cerevisiae sigma 1278b strain background; the genotypes 

30 are both strains are as follows: MATa ura3A0 leu2A0 

his3A: :hisG trplAO : : hisG . MY2145 and MY2159 contain the 
lovFp-lacZ reporter plasmids MB2114 and MB1918, 
respectively. 

MY2124 transformed with individual lovE variant 

35 plasmids was mated to S. cerevisiae strains MY2154 and 
MY2159. Diploids were selected on SC-UraTrp media. 
Multiple diploids from each individual mating were 
assayed for lovFp-lacZ expression using 96 well format [3- 
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galactosidase assays. For p-galactosidase assays, cells 
were transferred from transformation plates to 96 -well 
microtiter plates containing 200 Jll Z buffer. 12 strains 
were transferred simultaneously using a 12 -channel multi- 
pipettor to scoop cells from transformation plates. 
Duplicate samples were prepared for all assays. OD 60 o 
readings were taken on samples in Z buffer. These values 
were used to normalize for equal cell number in all 
assays. After determining OD 600 , 150 |Xl of each sample in 
Z buffer was transferred onto a Millipore Multiscreen 
Assay System (Nitrocellulose Immobilon NC) , filtered, and 
then washed by filtering 200 JL1X Z buffer. 100 |Xl Z 
buffer with pME and detergents was then added to each 
well, as was 20 |ll 4 mg/ml ONPG. Reactions were 
incubated at 30 °C, stopped with 50(Xl 1 M Na 2 C0 3/ filtered 
into a polystyrene 96 -well assay plate, and OD 420 was 
determined for each assay well. p-galactosidase units 
were determined using the Miller formula (O.D. 420 X 
1000)/ (OD600*minutes*volume in mL) . Z buffer is made by 
dissolving the following in 1 L of water (16.1 g Na 2 HP0 4 - 
7H 2 0, 5.5g NaH 2 P04-H20, 0 . 75 g KC1 and 0.246 g MgSO 4 -7H 2 0) . 
Z buffer with detergents and pME is made as follows: 9.8 
ml Z buffer, 100 ^il 20 mg/ml CTAB, 100 [ll 10 mg/ml sodium 
deoxycholate, and 6 9 |Lll pME Control plasmids utilized in 
these studies included MB968, MB2478 and MB1644 . 

Results of these studies are presented in Figures 2- 
5 , demonstrating increased transcription-activating 
properties of the lovE variants disclosed herein. 

Example 6: Secondary Metabolite Production 
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5 Transformation of filamentous fungi was performed 

according to the following procedure. Protoplasts were 
generated by inoculating rich media with spores. Spores 
were allowed to germinate for about 20 hrs or until germ 
tubes were between 5 and 10 spore lengths. The germlings 

10 were centrifuged and washed twice with sterile distilled 
water and once with 1 M magnesium sulfate. Germlings 
were then resuspended in 1M magnesium sulfate containing 
approximately 2 mg/ml of Novozyme . Tubes were then 
incubated at 30°C shaking at 80 RPM for about 2 hrs or 

15 until most of the hyphae were digested and protoplasts 
were abundant. Protoplasts were filtered through one 
layer of Miracloth. At least one volume of STC was added 
and protoplasts were centrifuged. Protoplasts were 
washed twice with STC. Protoplasts then were resuspended 

20 in 1ml STC and counted in a hemacytometer. A final 

concentration of approximately 5 x 10 7 protoplasts/ml were 
frozen in a 9:1:0.1 solution of STC, SPTC and DMSO in a 
Nalgene Cryo cooler at -80°C (cools -l°C/min) . 

Solutions for transformation were as follows: STC 

25 (0.8 M Sorbitol, 25 mM Tris-HCl pH 7 . 5 , 25 mM CaCl 2 ) and 
SPTC (0.8 M Sorbitol, 40% PEG 4000, 25 mM Tris-HCl pH 8, 
50 mM CaCl 2 ) . Transformation was accomplished according 
to the following protocol. 1-5 |Xg of DNA comprising a 
lovE variant according to the invention in a fungal 

30 expression vector was placed in a 50 ml Falcon tube. 100 
|ill of previously frozen protoplasts were added to the 
DNA, gently mixed, and then incubated on ice for 3 0 min. 
15 (Xl of SPTC was added, followed by mixing by tapping 
and incubation at RT for 15 min. 500 |Xl SPTC was added 

35 and mixed well by tapping and rolling, then incubated at 
RT for 15 min. 2 5 mis of regeneration minimal medium was 
added, mixed well and poured on plates containing 25 mis 
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5 of regeneration minimal medium with 2X the concentration 
of selection drug. 

Transformation plates were incubated at 2 6°C for 5-6 
days or until colonies started to appear. Regeneration 
minimal medium contains trace elements, salts, 25 mM 

10 sodium nitrate, 0.8 M Sucrose, and 1% agarose at pH 6.5. 
The selection drug that was used successfully with A. 
terreus is phleomycin, a broad- spectrum glycopeptide 
antibiotic. Transf ormants were picked onto new plates 
with a toothpick (if the fungus was sporulating) or with 

15 sterile forceps (if the fungus did not sporulate) . 

Purification plates contained minimal medium (same as 
regeneration minimal medium but containing 2 % instead of 
0.8 M sucrose) and IX drug concentration. Picked 
transf ormants were incubated at 26°C for 5-6 days. 

20 Transf ormants were grown in production media for 

secondary metabolite production. Briefly, for A . terreus 
and lovastatin production, spores were used as the 
inoculum. Spores were obtained from the purification 
plate by using a wooden inoculation stick. The medium 

25 was RPM containing corn steep liquor, sodium nitrate, 

potassium phosphate, magnesium sulfate, sodium chloride, 
P2000 (Dow chemical) , trace elements and lactose or ^ 
glucose as carbon source. The medium was pH 6.5. Flasks 
were incubated at 26°C with shaking at 225 RPM. For 

30 static 96 -well cultures, the same medium was used and the 
spores were obtained from the purification plate with a 
wooden toothpick. 96 -well plates were incubated, without 
shaking at 2 6°C. 

Sampling was done after after 5 days for 

35 lovastatin . For shake flask experiments 1-1.5 mis of 

supernatant was placed into 96 -well plates, which were 
centrifuged and supernatants transferred to new 96 -well 
plates. Samples were frozen at "80°C for storage or for 
later assays. 
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Cultures that were grown standing in a 96 -well plate 
were centrifuged and the supernatant was transferred to a 
new 96 well plate. Samples were frozen at ~80°C. 

Example 7: Measurement of Secondary Metabolite Production 

The concentration of the secondary metabolite 
lovastatin was determined by enzyme inhibition assay 
(Figure 6) . Briefly, 10 (XL of sample was removed and 
diluted 1:100 in H 2 0 . 10 |ll of this diluted broth was 
assayed in a reaction (200 |1L total) containing 1 mM 
HMGCoA, 1 mM NADPH, 0.005 mM DTT and 5 |ll (His ) 6 HMGR . The 
disappearance of absorbance at 34 0 nm was observed over 
time. This represents the disappearance of NADPH, and 
lovastatin inhibits this reaction. 

The initial velocities were calculated for the 
reactions containing samples, adjusted for dilution, and 
compared to reactions containing lovastatin standards to 
determine levels of metabolite produced. (His ) 6 HMGR was 
expressed in Sa.ccha.romyces cerevisiae and purified with a 
nickel column. 

The results from ten individual transf ormant s for 
each allele are shown in standard box plot format in 
Figure 6. Lovastatin concentration from the 
corresponding wild-type lovE control is shown in matching 
fill pattern. For example, lovE alleles 2, 7, 8 and 9 
were all transformed and assayed at the same time as the 
non-hatched wild-type control. The horizontal line in 
each individual box represents the median. 

Lovastatin concentration was also determined by high 
pressure liquid chromatography (HPLC) . Briefly, 100 |1L 
of broth sample was removed and diluted 1:10 into 70% H 2 0- 
30% acetonitrile (900 |ll) . This mixture was spun down to 
pellet debris at 13000 RPM for 5 minutes. 900 (al of this 
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5 diluted broth was transferred to a vial and the sample 
was analyzed by HPLC . 10 |Xl were injected into a Waters 
HPLC system (996 photo-diode array detector, 600 E pump 
controller and 717 autosampler) equipped with a YMC-Pack 
ODS column (Aq-302-3, 150 x 4 . 6 mm ID, S-3 (XM pore size) 

10 and eluted with isocratic 40% aqueous acetic acid (0.7%)- 
60% acetonitrile for 8 minutes. Lovastatin was detected 
at 238 nm to have a retention time of 6.5 minutes and was 
quantified using a calibration curve created from pure 
lovastatin samples. 

15 The results from ten individual transf ormants for 

each lovE variant are shown in standard box plot format 
in Figure 7A and 7B . Thirty individual wild- type lovE 
transf ormants and ten individual MB2143 negative control 
transf ormants were tested. Identical controls are 

20 plotted in Figures 7A and 7B. 

PCR analysis of A . terreus transf ormants 
demonstrates that greater than fifty percent of the 
transf ormants contain the transgene. Variability in 
levels of transgene expression can presumably be 

25 influenced by integration site and copy number. lovE 
variants containing identical amino acid substitutions 
are labeled. 

The amino acid and nucleic acid sequences of lovE 
variant sequences are presented in Table 5 and Table 6, 
30 respectively. 

Table 5 : Amino Acid Sequences of Variants of the lovE Gene 
lovE- 1 

maadqgif tnsvtlspvegsrtggt lprraf rrscdrchaqkikctgnkevtgrapcqrcqqagl 
rcvysercpkrklrqsraadlvsadpdpclhmssppvpsqslpldvseshssntsrqf ldppdsy 
dwswtsigtdeaidtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmf f nasrrlltvlrqqaqadcrqgtldec 
lrtknlf tavhcyilnvriltaiselllsqirrtqnshmsplegsrsqspsrddtssssghssvd 
tipf f senlpigelfpyvdplthalf sacttlhvgvqllreneitlgvhsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlfmf lsdegaf qeaksagsrgrtiaalrrcyedif slark 
hkhgmlrdlnnipp (SEQ ID NO: 41) 
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lovE-2 

maadqgif tnsvtlspvegsrtggtlprraf rrscdrchaqkikctgnkevtgrapcqrcqqagl 
rcvysercpkrklrqsraadlvsadpdpclhmssppvpsqslpldvseshssntswqf ldppdsy 
dwlwtsigtdeaidtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmf f nasrrlltvlrqqaqadchqgtldec 
lrtknlf tavhcyilnvriltaiselllsqirrtqnshmsplegsrsqspsrddtssssghgsvd 
tipf f senlpigelf syvdplthalf sacttlhvgvqllreneitlgvhsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlf mf lsdegaf qeaksagsrgrtiaalrrcyedif slark 

hkhgmlrdlnnipp (SEQ ID NO: 42) 

lovE-3 

maadqgif tnsvtlspvegsrtggtlprraf rrscdrchaqkikctgnkevtgrapcqrcqqagl 
rcvyserrpkrklrqsrvadlvsadpdpclhmssppvpsqslpldvseshssntsrqfldppdsy 
dwswisigtdeaidtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmff nasrrlltvlrqqaqadchqgtldec 
lrtknlf tavhcyilnvri It ai sell lsqirrtqnshmsplegsrsqspsrddtssssghssvd 
tipf f senlpigelf syvdplthalf sac ttlhvgvqllrene it lgvhsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlf mf lsdegaf qeaksagsrgrtiaalrrcyedif slark 

hkhgmlrdlnnipp (SEQ ID NO: 43) 

lovE-4 

maadqgif tnsvtlspvegsrtggtlprraf rrscdrchaqkikctgnkevtgrapcqrcqqagl 
rcvyserrpkrklrqsraadlvsadpdpclhmssppvpsqslpldvseshssntsrqf ldppdsy 
dwswtsigtdeaidtdcwglsqcdggf scqleptlpdlpspf estvgkaplppvssdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmff nasrrlltvlrqqaqadchqgtldec 
lrtknlf tavhcyilnvri ltai sell lsqirrtqnshmsplegsrsqspsrddtssssghssvd 
tipf f senlpigelf syvdplthalf sac ttlhvgvqllreneit lgvhsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlfmf lsdegaf qeaksagsrgrtiaalrrcyedif slark 

hkhgmlrdlnnipp (SEQ ID NO: 44) 

lovE-5 

maadqgif tnsvtlspvegsrtggtlprraf rrscdrchaqkikctgnkevtgrapcqrcqqagl 
rcvyserrpkrklrqsraadlvsadpdpclhmssppvpsqslpldvseshssntsrqf ldppdsy 
dwswtsigtdeaidtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmff nasrrlltvlrqqaqadchqgtldec 
lrtknlf tavhcyilnvri ltai sell lsqirrtqnshmsplegsrsqspsrddtssssghssvd 
tipf f senlpigelf syvdplthalf sac ttlhvgvqllrene it lgvhsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlf mf lsdegaf qeaksagsrgrtiaalrrcyedif slark 

hkhgmlrdlnnipp (SEQ ID NO: 45) 

lovE-6 

maadqgif tnsvtlspvegsrtggtlprraf rrscdrchaqkikctgnkevtgrapcqrcqqagl 
rcvysercpkrklrqsraadlvsadpdpclhmssppvpsqslpldvseshssntsrqf ldppdsy 
dwswtsigtdeaidtdcwglsqydggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
rklf ddlsavsqeleeillavtvewpkqeiwthpigmf f nasrrlltvlrqqaqadchqgtldec 
lrtknlf tavhcyilnvri laai sell lsqirrtqnshmsplegsrsqspsrddtssssghssvd 
tipf f senlpigelf syvdplthalf sac ttlhvgvqllrene it lgvhsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlfmf lsdegaf qeaksagsrgrtiaalrrcyedif slark 

hkhgmlrdlnnipp (SEQ ID NO: 46) : 

lovE-7 

maadqgif tnsvtlspvegsrtggtlprraf rrscdrchaqkikctgnkevtgrapcqrcqqagl 
rcvyserrpkrklrqsraadlvsadpdpclhmssppvpsqslpldvseshssntsrqf ldppdsy 
dwswtsigtdeaidtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmf f nasrrlltvlrqqaqadchqgaldec 
lrtknlf tavhcyilnvriltaiselllsqirrtqnshmsplegsrsqspsrddtssssghssvd 
tipf f senlpigelf syvdplthalf sac ttlhvgvqllrene it lgvhsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlfmf lsdegaf qeaksagsrgrtiaalrrcyedif slark 
hkhgmlrdlnsipp (SEQ ID NO: 47) 
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lovE-8 

maadqgif tnsvtlspvegsrtggt lprraf rrscdrchaqkikctgnkevtgrapcqrcqqagl 
rcvyserrpkrklrqsraadlvsadpdpclhmssppvpsqslpldvseshssntsrqf ldppdsy 
dwswtsigtdeaidtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
relf ddlsavsqe leei 11 avtvewpkqeiwthpigmff nasrrlltvlrqqaqadchqgaldec 
Irtknlf tavhcyilnvriltaiselllsqirrtqnshmsplegsrsqspsrddtssssghssvd 
tipf f senlpigelf syvdpl thai f sac ttlhvgvqllreneit lgvhsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlf mf lsdegaf qeaksagsrgrtiaalrrcyedif slark 

hkhgmlrdlnsipp (SEQ ID NO: 48) 

lovE-9 

maadqgif tnsvtl spvegsrtggt lprraf rrscdrchaqkikctgnkevtgrapcqrcqqagl 
rcvyserrpkrklrqsraadlvsadpdpclhmssppvpsqslpldvseshssntsrqf ldppdsy 
dwswtsigtdeaidtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmf f nasrrlltvlrqqaqadchqgaldec 
Irtknlf tavhcyilnvriltai sell lsqirrtqnshmsplegsrsqspsrddt ssssghssvd 
tipf f senlpigelf syvdpl thai f sac ttlhvgvql Irene it lgvhsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlfmf lsdegaf qeaksagsrgrtiaalrrcyedif slark 

hkhgmlrdlnsipp (SEQ ID NO: 4 9) 

lovE-10 

maadqgif tnsvtl spvegsrtggt lprraf rrscdrchaqkikctgnkevtgrapcqrcqqagl 
rcvyserrpkrklrqsraadlvsadpdpclhmssppvpsqslpldvseshssntsrqf ldppdsy 
dwswtsigtdeaidtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmf f nasrrlltvlrqqaqadchqgaldec 
Irtknlf tavhcyilnvril tai sell lsqirrtqnshmsplegsrsqspsrddt ssssghssvd 
tipf f senlpigelf syvdpl thai f sac ttlhvgvql Irene it lgvhsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlf mf lsdegaf qeaksagsrgrtiaalrrcyedif slark 

hkhgmlrdlnsipp (SEQ ID NO: 50) 

lovE-16 

maadqgi f mnsvtlsavegsrtsgt lprraf rrscdrchakkikctgnkevtgrapcqrcqqagl 
rcvysercpkrklrqsraadlvsadpdpclhmssppvpsqslpldvseshssntsrqf ldppdsy 
dwswtsigtdeaidtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmf f nasrrlltvlrqqaqadchqgt Idee 
Irtknlf tavhcyilnvril tai sell lsqirrtqnshmsplegsrsqspsrddt ssssghssvd 
tipf f senlpigelf syvdpl thai f sac ttlhvgvel Irene it lgvhsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlf mf lsdegaf qeaksagsrgrtiaalrrcyedif slark 

hkhqmlrdlnnipp (SEQ ID NO: 51) 

lovE-19 

maadqgif tnsvtl spvegshtggt lprraf rracdrchaqkikctgnkevtgrapcqrcqqagl 
rcvysercpkrklrhsrasdlvsadpdpclhmssppvpsqslpldvseshssntsrqf ldppdsy 
dwswtsigtdeaidtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmff nasrrlltvlrqqaqadchqgt Idee 
Irtknlf tavhcyilnvri ltai sell lsqirrtqnshmspldgsrsqspsrddt ssssghssvd 
tipf f senlpigelf syvdpl thai f sac ttlhvgvql Irene it lgvdsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlfmf lsdegaf qeaksagsrgrtitvlrrsyedif slark 

hkhgmlrdlnnips (SEQ ID NO:52) 

lovE-20 

maadqgif tnsvtl spvegsrtggt Iprralrrscdrchaqkikctgnkevtgrapcqrcqqagl 
rcvysercpkrklrqsraadlvsadpdpclhmssppvpsqslpldvseshssntsrqf ldppdsy 
dwswtsigtdeaidtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmff nasrrlltvlrqqaqadchqgt Idee 
Irtknlf tavhcyilnvri ltai sell lsqirrtqnshmsplegsrsqspsrddt ssssghssvd 
tipf f senlpigelf syvdpl thai f sac ttlhvgvql Irene it lgvhsaqgiaasismsgepg 
ediartgatnsarceeqpitpaarvlfmf lsdegaf qeaksagsrgrtiaalrrcyedif slark 
hkhqmlrdlnnipp (SEQ ID NO: 53) 
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lovE-21 

maadqgif tnsvt lspvegsrtggt lprralrrscdrchaqkikctgnkevtgrapcqrcqqagl 
rcvysercpkrklrqsraadlvsadpdpclhissppvpsqslpldvsdshssntsrqf ldppdsy 
dwswtsigtdeaidtncwglsqcdggf scqlestlpdlpspf estvekaplppvssdiaraasaq 
re If ddlsavsqeleeillavtvewpkqeiwthpigmf f nasrrlltvlrqqaqadchqgtldec 
lrtknlftavhcyilnvriltaiselllsqirrtqnshmsplegsrsqspsrddtssssghssvd 
tipf f senlpigelf syvdplthalf sactt Ihvgvqllreieitlgvhsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlf mf lsdegaf qeaksagsrgrtiaalrrcyedif slark 

hkygmlrdlnnipp (SEQ ID NO: 54) 

lovE-30 

maadqgif tnsvt lspvegsrtggtlprraf rrscdrchaqkvkctgnkevtgrapcqrcqqagl 
rcvysercpkrklrqsraadlvsadpdpclhmssppvpsqslpldvseshssntsrqf ldppdsy 
dwswtsigtdeaidtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmf f nasrrlltvlrqqaqadchqgtldec 
lrtknlf tavhcyilnvriltaiselllsqirrtlnshmsplegsrsqspsrddtssssghssvd 
tipf f senlpigelf syvdplthalf sactt lhvgvqllrene it lgvhsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlf mf lsdegaf qeaksagsrgrtiaalrrcyedif slark 

hkhgmlrdlnnippc (SEQ ID NO: 55) 

lovE-31 

maadqgif tnsvt lspvegsrtggt lprralrrscdrchaqkikctgnkevtgrapcqrcqqagl 
rcvysercpkrklrqsraadlvsadpdpclhmsspsvpsqslpldvseshssntsrqf ldppdsy 
dwswtsigtdeaidtdcwglsqrdggf ssqlkptlpdlpspf estvekaplppvssdiaraasaq 
relfddlsavsqeleeillavtvewpkqeiwthpigmff nasrrlltvlrqqaqadchqgtldec 
lrtknlf tavhcyilnvri Itai sell lsqirl tqnshmsplegsrsqspnrddt ssssghssvd 
tipf f senlpigelf syvdplthalf sactt lhvgvqllrene it lgvhsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlfmf lsdegaf qeaksagsrgrtiaalrrcyedif slark 

hkhgmlrdlnnipp (SEQ ID NO: 56) : 

lovE-32 

maadqgif tnsvt ispwgsrtggtlprraf rrscdrchaqkikctgnkevtgrapcqrcqqagl 
rcvysercpkrklrqsraadlvsadpdpclhmssppvpsqslpldvseshssntsrqf ldppdsy 
dwswtsictdeaidtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
relfddlsavsqeleeillavtvewpkqeiwthpigmff nasrrlltvlrqqaqadchqgtldec 
lrtknlf tavhcyilnvriltaiselllsqirrtqnshmsplegsrsqspsrddt ssssghssvd 
tipf f senlpigglf syvdplthalf sactt lhvglqllreneitlgvhsaqgiaasismsgesg 
ediartgatssarceeqpttpaarvlf mf lsdegaf qeaksagsrgrtiaalrrcyedif slark 

hkhgmlrdlnnipp (SEQ ID NO: 57) [ 

lovE-33 

maadqgif tnsvt lspvegsrtggt lprraf rrscdrcharkikctgnkevtgrapcqrcqqagl 
rcvysercpkrklrqsraadlvsadpdpclhmssppvpsqslpldvseshssntsrqf ldppdsy 
dwswtsigtdeaidtdcwglsqcdggf scqleptlpdlpspf eytvekaplppvssdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmff nasrrlltvlrqqaqadchqgtldec 
lrtknlf tavhcyilnvri ltaiselllsqirrtqnshmsplegsrsqspsrddt ssssghssvd 
tipf f senlpigelf syvdplthalf sactt lhvgvqllrene it lgvhsaqgiaasismsgepg 
ediartgatnstrceeqpttpaarvlfmf lsdegaf qeaksagsrgrtiaalrrcyedif slark 

hkhgmlrdlnnipp (SEQ ID NO: 58) 

lovE-34 

maadqgif tnsvt lspvegsrtggt Iprralrrscdrchaqkikctgnkevigrapcqrcqqagl 
rcvysercpkrklrqsraadlvsadpdpclhmsspqypsqslsldiseshssntsrqf ldppdsy 
dwswtsigtdeaidtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmff nasrrlltvlrqqaqadchqgtldec 
lrtknlf tavhcyilnvriltaiselllsqirrtqnshmsplegsrsqspsrddt ssssghssvd 
tipf f senlpigelf syvdplthalf sactt lhvgvqllrene it lgvhsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlfmf lsdegaf qeaksagsrgrtiaalrrcyedif slark 

hkhgmlrdlnnipp (SEQ ID NO: 59) . 
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lovE-36 

maadqgif tnsvt lspvegsrtggtlprraf rrscdrchaqkikctgnkevtgrapcqrcqqagl 
rcvysercpkrkl rqsraanlvsadpdpclhms sppvpsqs lpldvseshs snt srqf Idppdsy 
dwswtsigtdeaf dtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigif f nasrrlltvlrqqaqadchqgt Idee 
lrtknlf tavhcyilnvriltaiselllsqirrtqnshmsplegsrsqspsrddissssghssvd 
tipf fsenlpigelf syvdpl thai f sac ttlhvgvqllrenei tlgvhsaqgiaayisksgepg 
ediartgatnsarceeqpttpaarvlf mf Isdegaf qeaksagsrgrt iaalrrcyedif slark 

hkhgmlrdlnnipp (SEQ ID NO: 60) . 

lovE-37 

maadqgif tnsvt lspvegsrtggtlprraf rrscdrchaqkikcignkevtgrapcqrcqragl 
rcvysercpkrrlrqsraadlvsadpdpclhms sppvpsqs lpldvseshs snt srqf Idppdsy 
dwswtsigtdeai dtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmff nasrrlltvlrqqaqadchqgt Idee 
lrtknlf tavhcyilnvri It ai sell lsqirrtqnshmsplegsrsqspsrddtssssghscvd 
tipf fsenlpigelf syvdpl thai f sac ttlhvgvqllreyei tlgihsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlfmf Isdegaf qeaksagsrgrt iaalrrcyedif slark 

hkhgmlrdlnnipp (SEQ ID NO: 61) 

lovE-38 

maadqgif tnsvt lspvegsrtggtlprraf rrscdrcharkikctgnkevtgrapcqrcqqagl 
rcvysercpkrkl rqsraadlvsadpdpclhms sppvpsqs lpldvseshs snt srqf Idppdsy 
dwswtsigtdeaidtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmff nasrrlltvlrqqaqadchqgt Idee 
lrtknlf tavhcyilnvri ltai sell lsqirriqnshmsplegsrsqs Is rddt ssssghssvd 
tipf f senlpide If syvdpl thai f sac ttlhvgvql Irene it lgvhsaqgiaasismsgelg 
edivrtgatnsarceeqpttpaarvlfmf Isdegaf qeaksagsrsrt iaalrrcyedif slark 

hkhgmlrdlnnipp (SEQ ID NO: 62) ; 

lovE-39 

maadqgif tnsvt lspvegsrtggtlprraf rrscdrchaqkikctgnkevngrapcqrcqqagl 
rcvysercpkrklrqsraadlvsadpdpclhmssppvpsqslpldiseshs snt srqf Idppdsy 
dwswtsigideaidtdcwglsqcdggf scqleptlpdlpspf estvekaplppissdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmf fnasrrll tvlrqqaqadchqgt Idee 
lrtknlf tavhcyilnvri laai sell lsqirrtqnshmsplegsrsqsps rddt ssssghssvd 
tipf fsenlpigelf syvdpl thai f sac ttlhvgvql Irene it lgvhsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlfmf Isdegaf qeaksagsrgrt iaalrrcyedif slark 

hkhgmlrdlnnipp (SEQ ID NO: 63) 

lovE-40 

maaeqgif tnsvt lspvegsrtggtlprraf rrscdrcharkikctgnkevtgrapcqrcqqagl 
rcvysercpkrkl rqsraadlisadpdpclhmssppvpsqslplevseshs snt srqf Idppdsy 
dwswtsigtdkaidtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssditraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmf fnasrrll tvlrqqaqadchqgt Idee 
lrtknlf tavhcyildvriltaiselllsqirrtqnshmsplegsrsqspsrddt ssssghssvd 
tipf fsenlpigelf syvdplrhalf sac ttlhvgvqllreie it lgvhsargiaasismsgepg 
ediartgatnsarceeqpttpaarvlfmf lsdegtf qeaksagsrgrt iaalrrcyedif slark 

hkhgmlrdlnnipp (SEQ ID NO: 64) 

lovE-41 

maadqgif tnsvt lspvegsrtggtlprraf rrscdrchaqkikctgnkevtgrapcqrcqqagl 
rcvysercpkrklrqsraadlvsadpdpclhms sppvpsqs lpldvseshs snt srqf Idppdsy 
nwlwtsigtdeaidtdcwglsqcdggf scqleptlpdlpspf estvekaplppvssdiaraasaq 
relf ddlsavsqeleeillavtvewpkqeiwthpigmf fnasrrll tvlrqqaqadchqgt Idee 
lrtknlf tavhcyilnvriltaiselllsqirrtqnshmsplegsrsqspsgddt ssssghssvd 
tipf fsenlpigelf syvdpl thai f sac ttlhvgvql Irene it lgvhsaqgiaasismsgepg 
ediartgatnsarceeqpttpaarvlfmf Isdegaf qegksagsrgrt iaalrrcyedif slark 
hkhgmlrdlnnipp (SEQ ID NO: 65) ' 
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Table 6: DNA Sequences of Variants of the lovE Gene 

lovE- 1 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 
CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCACAAAAGATCA 
AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 
CGATGCGTCTACAGTGAGCGATGCCCCAAGCGCAAGCTACGCCAATCCAGGGCAGCGGATCTCGT 
CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTCCAGTGCCCTCACAGAGCTTGCCGC 
TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAGTTTCTTGATCCACCGGACAGCTAC 
GACTGGTCGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 
ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 
CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 
CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 
GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 
GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCGTCAAGGCACACTAGACGAATGT 
TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 
CATATCGGAGTTGCTCCTGTCGCAAATTAGGCGGACCCAGAACAGCCATATGAGCCCACTGGAAG 
GGAGTCGATGCCAGTCGCCGAGCAGAGACGACACCAGCAGCAGCAGCGGCCACAGCAGTGTTGAC 
ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCCCCTATGTTGACCCCCTGAC 
ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 
TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 
GAGGAT AT AGCC AGGACAGGGGCGAC CAAT TC CGCAAGATGCGAGGAGCAGC CGAC CACT C C AGC 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 
CCCGAGGTCGAACCATCGGAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 

CACAAACATGGCATGCTCAGAGACCTCAACAATATTCCTCCATGA (SEQ ID MO: 66) 

lovE-2 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 
CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCACAAAAGATCA 
AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 
CGATGCGT CTACAGTGAGCGATGC CC CAAGCGCAAGCTACGC CAAT CC AGGGCAGCGGATCTCGT 
CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTCCAGTGCCCTCACAGAGCTTGCCGC 
TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCTGGCAATTTCTTGATCCACCGGACAGCTAC 
GACTGGTTGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 
ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAAGGCTGCCGGATCTACCTTCGCCCTTCGAGT 
CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 
CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 
GGTAGAGTGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 
GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 
TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 
CATATCGGAGTTGCTCCTGTCGCAAATTAGGCGGACCCAGAACAGCCATATGAGCCCACTGGAAG 
GGAGTCGATCCCAGTCGCCGAGCAGAGACGACACCAGCAGCAGCAGCGGCCACGGCAGTGTTGAC 
ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAC 
ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 
TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 
GAGGATATAGCC AGGACAGGGGCGAC CAATTC CGCAAGATGCGAGGAGCAGCCGAC CACTCCAGC 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 
CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 
C ACAAACATGGCATGCTC AGAGAC CTCAACAATATT CCTC CATGA (SEQ ID NO: 67) 
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lovE-3 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 
CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCACAAAAGATCA 
AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 
CGATGCGTCTACAGTGAGCGACGCCCCAAGCGCAAGCTACGCCAATCCAGGGTAGCGGATCTCGT 
CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTCCAGTGCCCTCACAGAGCTTGCCGC 
TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 
GACTGGTCGTGGATCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCGA 
ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 
CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 
CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 
GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 
GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 
TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 
CATATCGGAGTTGCTCCTGTCGCAAATTAGGCGGACCCAGAACAGCCATATGAGCCCACTGGAAG 
GGAGTCGATCCCAGTCGCCGAGCAGAGACGACACCAGCAGCAGCAGCGGCCACAGCAGTGTTGAC 
ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAC 
ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 
TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 
GAGGAT AT AG CC AGGACAGGGG CGAC C AAT TC CG CAAGAT GC GAGGAGCAGC CGAC CACT CC AGC 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 
CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 

C ACAAACATGGC ATGC T C AGAGAC CT CAAC AATATT C CT C CATGA (SEQ ID NO: 68) 

IovK-4 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 
CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCACAAAAGATCA 
AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 
CGATGCGTCTACAGTGAGCGACGCCCCAAGCGCAAGCTACGCCAATCCAGGGCAGCGGATCTCGT 
CTCTGCTGACCCAGATCCCTGGTTGCACATGTCCTCGCCTCCAGTGCCCTCACAGAGCTTGCCGC 
T AGACGTAT C CGAGT CGC AT T C CT CAAAT A CC TC C C GGC AATTT CT TGAT C C AC CGGACAGCTA C 
GACTGGTCGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 
ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 
CTACGGTTGGAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 
CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 
GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 
GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 
TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 
CATATCGGAGTTGCTCCTGTCGCAAATTAGGCGGACCCAGAACAGCCATATGAGCCCACTGGAAG 
GGAGTCGATCCCAGTCGCCGAGCAGAGACGACACCAGCAGCAGCAGCGGCCACAGCAGTGTTGAC 
ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAC 
ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 
TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 
GAGGAT AT AGC C AGGACAGGGG CGAC CAAT T C CG CAAGATGCGAGGAGCAGC CGAC CACT CC AGC 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 
CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 
C ACAAACATGGC ATGCTC AGAGAC CTCAAC AATATTC CTC CATGA ( SEQ ID NO:69) 
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lovE-5 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 
CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCACAAAAGATCA 
AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 
CGATGCGTCTACAGTGAGCGACGCCCCAAGCGCAAGCTACGCCAATCCAGGGCAGCGGATCTCGT 
CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTCCAGTGCCCTCACAGAGCTTGCCGC 
TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 
GACTGGTCGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 
ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 
CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 
CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 
GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 
GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 
TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 
CATATCGGAGTTGCTCCTGTCGCAAATTAGGCGGACCCAGAACAGCCATATGAGCCCACTGGAAG 
GGAGTCGATCCCAGTCGCCGAGCAGAGACGACACCAGCAGCAGCAGCGGCCACAGCAGTGTTGAC 
ACCATACC CT TCTTTAGCGAGAAC CT CC CTATTGGTGAGCTGTT CT CCTATGTTGACC CC CTGAC 
ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 
TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 
GAGGAT AT AGCC AGGACAGGGGCGAC CAAT T C CGCAAGATGCGAGGAG CAGC CGAC TACT C C AGC 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 
CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 

CACAAACATGGCATGCTCAGAGAC CTCAACAATATTC CT CCATGA (SEQ ID NO: 70) 

lovE- 6 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 
CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCACAAAAGATCA 
AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 
CGATGCGTCTACAGTGAGCGATGCCCCAAGCGCAAGCTACGCCAATCCAGGGCAGCGGATCTCGT 
CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTCCAGTGCCCTCACAGAGCTTGCCGC 
TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 
GACTGGTCGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 
ATATGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 
CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 
CGAAAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 
GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 
GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 
TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGGCCGC 
CATATCGGAGTTGCTCCTGTCGCAAATTAGGCGGACCCAGAACAGCCATATGAGCCCACTGGAAG 
GGAGTCGATC C C AGTCGC CGAGCAGAGACGAC AC CAGCAGCAGC AGCGGC CACAGC AGTGTTGAC 
ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAC 
ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 
TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCGATCAGCATGAGCGGGGAACCAGGC 
GAGGAT AT AG CCAGGACAGGGG CGAC CAAT T C CGCAAGATGCGAGGAGCAGC CGAC CACT C CAGC 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 
CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 
CACAAACATGGCATGCTCAGAGACCT CAACAATATTC CTC CATGA (SEQ ID NO: 71) 


BOSTON 1322173vl 


-63- 


lovE-1 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 
CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCACAAAAGATCA 
AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 
CGATGCGTCTACAGTGAGCGACGCCCCAAGCGCT^GCTACGCCT^LATCCAGGGCAGCGGATCTCGT 
CTCTGCTGACCCAGATCCCTGGTTGCACATGTCCTCGCCTCCAGTGCCCTCACAGAGCTTGCCGC 
TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 
GACTGGTCGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 
ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 
CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 
CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 
GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 
GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCGCACTAGACGAATGT 
TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 
CATATCGGAGTTGCTCCTGTCGCAAATTAGGCGGACCCAGAACAGCCATATGAGCCCACTGGAAG 
GGAGTCGATCCCAGTCGCCGAGCAGAGACGACACCAGCAGCAGCAGCGGCCACAGCAGTGTTGAC 
ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAC 
ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 
TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 
GAGGATATAGCCAGGACAGGGGCGACCAATTCCGCAAGATGCGAGGAGCAGCCGACCACTCCAGC 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 
CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 

C ACAAACATGGC ATGCTCAGAGAC CT CAACAGTATT C CTC CATGA (SEQ ID NO: 72) 

lovE-S 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 

CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCACAAAAGATCA 

AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 

CGATGCGTCTACAGTGAGCGACGCCCCAAGCGCAAGCTACGCCAATCCAGGGCAGCGGATCTCGT 

CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTCCAGTGCCCTCACAGAGCTTGCCGC 

TAGACGTATCCGAGTGGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 

GACTGGTCGTGGACCTCGATTGGCACTGACGTGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 

ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 

CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 

CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 

GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 

GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 

TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 

CATATCGGAGTTGCTCCTGTCGCAAATTAGGCGGACCCAGAACAGCCATATGAGCCCACTGGAAG 

GGAGTCGATC CC AGTCGCCGAGCAGAGACGAC AC CAGCAGCAGCAGCGGC CACAGCAGTGTTGAC 

ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAC 

ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 

TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 

GAGGAT AT AG C C AGGACAGGGGCGAC CAATT C CG CAAGATGCGAGGAGCAGC CGAC CACT CC AGC 

GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 

CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 

CACAAACATGGCATGCTCAGAGACCTCAACAATATTCCTCCATGA (SEQ ID NO: 73) 


BOSTON 1322173vl 


-64- 


lovE- 9 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 
CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCACAAAAGATCA 
AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 
CGATGCGTCTACAGTGAGCGACGCCCCAAGCGCAAGCTACGCCAATCCAGGGCAGCGGATCTCGT 
TTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTCCAGTGCCCTCACAGAGCTTGCCAC 
TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 
GACTGGTCGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 
ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 
CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 
CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 
GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 
GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 
TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAACGTGCGGATTTTGACCGC 
CATATCGGAGTTGCTCCTGTCGCAAATTAGGCGGACCCAGAACAGCCATATGAGCCCACTGAAAG 
GGAGTCGATCCCAGTCGCCGAGCAGAGACGACACCAGCAGCAGCAGCGGCCACAGCAGTGTTGAC 
ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAC 
ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 
TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 
GAGGAT AT AG CC AGGACAGGGG CGAC CAATT C CGCAAGATGCGAGGAG CAGC CGAC CACT CC AGC 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 
CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 

CACAAACATGGCATGCTCAGAGAC CTCAACAATATT C CTC CATGA (SEQ ID NO: 74) 

lovK-10 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 
CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCAAAAAAGATCA 
AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 
CGATGCGTCTACAGTGAGCGATGCCCCAAGCGCAAGCTACGCCAATCCAGGGCAGCGGATCTCGT 
CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTCCAGTGCCCTCACAGAGCTTGCCGC 
TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 
GACTGGTCGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 
ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 
CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 
CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 
GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 
GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 
TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 
CATATCGGAGTTGCTCCTGTCGCAAATTAGGCGGACCCAGAACAGCCATATGAGCCCACTGGAAG 
GGAGTCGATCCCAGTCGCCGAGCAGAGACGACACCAGCAGCAGCAGCGGCCACAGCAGTGTTGAC 
AC CATACC CTTC TTTAGCGAGAACCT C C CT ATTGGTGAGCTGTT CT CCTATGTTGAC C CC CTGAC 
ACACGCCCTATTCTCGGCTTGCACTACGCTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 
TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 
GAGGAT AT AGC CAGGACAGGGGCGAC CAAT T C CGCAAGATGCGAGGAGCAGC CGAC CACT C CAGC 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 
CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 
C AC AAACATGGC ATGC T CAGAGAC CT CAAC AATATT C CT C CATGA (SEQ ID NO: 75) 


BOSTON 1322173vl 


-65- 


lovE-16 

ATGGCTGCAGATCAAGGTATATTCATGAACTCGGTCACTCTCTCTGCAGTGGAGGGTTCACGCAC 
CAGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCAAAAAAGATCA 
AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 
CGATGCGTCTACAGTGAGCGATGCCCCAAGCGCAAGCTACGCCAATCCAGGGCAGCGGATCTCGT 
CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTCCAGTGCCCTCACAGAGGTTGCCGC 
TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 
GACTGGTCGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 
ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 
CTACAGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 
CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 
GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 
GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 
TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 
CATATCGGAGTTGCTCCTATCGCAAATTAGGCGGACCCAGAACAGCCATATGAGCCCACTGGAAG 
GGAGTCGATCCCAGTCGCCGAGCAGAGACGACACTAGCAGCAGCAGCGGCCACAGCAGTGTTGAC 
ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAC 
ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTAGAATTGCTGCGTGAGAATGAGA 
TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 
GAGGAT AT AGCC AGGA CAGGGGCGAC CAAT T C CGCAAGATGCGAGGAGCAGC CGAC CACT C C AG C 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 
CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 

C ACAAACATGGCATGCTCAGAGAC CTCAACAATATT C CTCCATGA (SEQ ID NO: 76) 

lovE- 19 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACACAC 

CGGTGGAACATTACCCCGCCGTGCATTCCGACGCGCTTGTGATCGGTGTCATGCACAAAAGATCA 

AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 

CGATGCGTCTACAGTGAGCGATGCCCCAAGCGCAAGCTACGCCATTCCAGGGCATCGGATCTCGT 

CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTCCAGTGCCCTCACAGAGCTTGCCGC 

TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 

GACTGGTCGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 

ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 

CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 

CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 

GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 

GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 

TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 

CATATCGGAGTTGCTCCTGTCGCAAATTAGGCGGACCCAGAACAGCCATATGAGCCCACTGGACG 

GGAGTCGATCCCAGTCGCCGAGCAGAGACGACACCAGCAGCAGCAGCGGCCACAGCAGTGTTGAC 

ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTATTCTCCTATGTTGACGCCCTGAC 

ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 

TTACTCTGGGAGTAGACTCCGCCCAGGGCATTGCAGCTTCGATCAGCATGAGCGGGGAACCAGGC 

GAGGAT AT AGC CAGGACAGGGGCGAC CAATT C CG CAAGAT GCGAGGAGCAGC CGAC CACT C C AGC 

GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 

CCCGAGGTCGAACCATCACAGTACTGCGACGAAGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 

C ACAAACATGGC AT GC T C AGAGAC CT CAAC AAT ATT C CTT CATGA (SEQ ID NO: 77) 


BOSTON 1322173vl 


-66- 


lovE- 2 0 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 
CGGTGGAACATTACCCCGCCGTGCACTCCGACGCTCTTGTGATCGGTGTCATGCACAAAAGATCA 
AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 
CGATGCGTCTACAGTGAGCGATGCCCCAAGCGCAAGCTACGCCAATCCAGGGCAGCGGATCTCGT 
CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTCCAGTGCCCTCACAGAGCTTGCCGC 
TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 
GACTGGTCGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 
ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 
CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 
CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 
GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 
GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 
TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 
CATATCGGAGTTGCTCCTGTCGCAAATTAGGCGGAC C CAGAACAGC CATATGAGCC CACTGGAAG 
GGAGTCGATCCCAGTCGCCGAGCAGAGACGACACCAGCAGCAGCAGCGGCCACAGCAGTGTTGAC 
ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAC 
ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 
TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 
GAGGAT AT AGCC AGGACAGGGGCGACCAATTC CGCAAGATGCGAGGAGCAGCCGATCACT CCAGC 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 
CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 
CACAAACATGGCATGCTC AGAGAC CT CAACAATATTC CTC CATGA (SEQ ID NO: 78) 

lovE-21 ~~ 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 

CGGTGGAACATTACCCCGCCGTGCACTCCGACGCTCTTGTGATCGGTGTCATGCACAAAAGATCA 

AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 

CGATGCGTCTACAGTGAGCGATGCCCCAAGCGCAAGCTACGCCAATCCAGGGCAGCGGATCTCGT 

CTCTGCTGACCCAGATCCCTGCTTGCACATATCCTCGCCTCCAGTGCCCTCACAGAGCTTAGCGC 

TAGACGTATCCGATTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 

GACTGGTCGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTAACTGCTGGGGGCTGTCCCA 

ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGTCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 

CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 

CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 

GGTAGAATGGCCTAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 

GGCTTCTTACTGTCCTGCGCGAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 

TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 

CATATCGGAGTTGCTCCTGTCGCAAATTAGGCGGACC CAGAACAGC CATATGAGCC CACTGGAAG 

GGAGTCGATC CCAGTCGCCGAGCAGAGACGACAC CAGCAGCAGCAGCGGC CACAGCAGTGTTGAC 

ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAC 

ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGATTGAGA 

TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 

GAGGAT AT AGC CAGGACAGGGGCAAC CAAT T C CGCAAGATGCGAGGAGCAGC CGAC CACT C C AG C 

GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 

CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 

CACAAATATGGCATGCTC AGAGAC CT CAACAATATTC CTC CATGA (SEQ ID NO: 79) 


BOSTON 1322173vl 


-67- 


lavE- 30 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 
CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCACAAAAGGTCA 
AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 
CGATGCGTCTACAGTGAGCGATGCCCCAAGCGCAAGCTACGCCAATCCAGGGCAGCGGATCTCGT 
CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTGGCCTCCAGTGCCCTCACAGAGCTTGCCGC 
TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 
GACTGGTCGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 
ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 
CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 
CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCGTTCTGGCCGTGAC 
GGTAGAATGGCCGAAG CAGGAAAT CTGGAC C CAT C C C ATCGGAATGTTTT T C AATGCGT C ACGAC 
GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 
TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 
C AT AT CGGAGTTGCTC CTGT CGCAAATT AGGCGGAC C CTGAACAGC CAT ATGAGC C CACTGGAAG 
GGAGTCGATCCCAGTCGCCGAGCAGAGACGACACCAGC^GCAGCAGCGGCCACAGCAGTGTTGAC 
ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAC 
ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 
TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 
GAGGATATAGCCAGGACAGGGGCGAC CAATTC CGCAAGATGCGAGGAGCAGC CGAC CACTCC AGC 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 
CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 

CACAAACATGGCATGCTCAGAGACCTCAACAATATTCCTCCATGA (SEQ ID NO: 80) 

lovE-31 " 

ATGGCTGCAGATCAAGGTATATTCACGAACTCCGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 

CGGTGGAACATTACCCCGCCGTGCATTACGACGCTCTTGTGATCGGTGTCATGCACAAAAGATCA 

AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 

CGATGCGT CTAC AGTGAGCGATGC CC CAAGCGCAAGTTACGC CAAT CCAGGGCAGCGGAT CT CGT 

CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTTCAGTGCCCTCACAGAGCTTGCCGC 

TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 

GACTGGTCGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 

ACGTGATGGAGGCTTCAGCTCTCAGTTAAAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 

CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 

CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 

GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 

GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 

TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 

CATATCGGAGTTGCTACTGT CGCAAATT AGGCTGAC C CAGAACAGC CATATGAGCC CACTGGAAG 

GGAGTCGATCCCAGTCGCCGAACAGAGACGACACCAGCAGCAGCAGCGGCCACAGCAGTGTTGAC 

ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAC 

ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 

TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 

GAGGAT AT AGC C AGGACAGGGGCGAC CAAT T C CG CAAGATGCGAGGAGCAGC CGAC CACT CC AGC 

GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 

CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 

CACAAACATGGCATGCTCAGAGAC CTCAAC AATATTC CTC CATGA (SEQ ID NO: 81) 


BOSTON 1322173vl 


-68- 


lovE-32 

ATGGCTGCAGATCAAGGTATATTCACTAACTCGGTCACTATCTCGCCAGTGGTGGGTTCACGCAC 
CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCACAAAAGATCA 
AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 
CGATGCGTCTACAGTGAGCGATGCCCCAAGCGCAAGCTACGCCAATCCAGGGCAGCGGATCTCGT 
CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTCCAGTGCCCTCACAGAGTTTGCCGC 
TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 
GACTGGTCGTGGACCTCGATTTGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 
ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 
CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 
CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 
GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 
GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 
TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 
C ATAT CGGAGTTGC T C CTGT CG CAAATT AGGC GGAC C C AGAACAGC CAT ATGAGC C CACTGGAAG 
GGAGTCGATCCCAGTCGCCGAGCAGAGACGACACCAGCAGCAGCAGCGGCCACAGCAGTGTTGAC 
ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGGGCTGTTCTCCTATGTTGACCCCCTGAC 
ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGCTACAATTGCTGCGTGAGAATGAGA 
TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAATCAGGC 
GAGGAT AT AGC C AGGACAGGGG CGAC CAGTT C CGCAAGATGCGAGGAG CAGC CGAC C ACT C C AGC 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 
CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 
C ACAAA CATGGC ATGC T C AGAGAC CT CAAC AATATT C CT C CATGA ( SEQ ID NO : 82 ) 


lovE-33 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 

CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCACGAAAGATCA 

AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 

CGATGCGTCTACAGTGAGCGATGCCCCAAGCGCAAGCTACGCCAATCCAGGGCAGCGGATCTCGT 

CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTCCAGTGCCCTCACAGAGCTTGCCGC 

TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 

GACTGGTCGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 

ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 

ATACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGGAA 

CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 

GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 

GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 

TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 

CATATCGGAGTTGCTC CTGT CGCAAATTAGGCGGAC C CAGAACAGC C AT ATGAGC C CACTGGAAG 

GGAGTCGATCCCAGTCGCCGAGCAGAGACGACACCAGCAGCAGCAGCGGCCACAGCAGTGTTGAC 

ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAC 

ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 

TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 

GAGGAT AT AGCC AGGACAGGGG CGAC CAAT T C CACAAGATGCGAGGAG CAGC CGAC CACT C CAGC 

GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 

CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 

CACAAACATGGCATGCTC AGAGAC CT CAAC AATATTCCTC CATGA (SEQ ID NO: 83) 


BOSTON 1 322173v 1 


-69- 


ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 
CGGTGGAACATTACCCCGCCGTGCATTGCGACGCTCTTGTGATCGGTGTCATGCACAAAAGATCA 
AATGTACTGGAAATAAGGAGGTTATTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 
CGATGCGTATACAGTGAGCGATGCCCCAAGCGCAAGCTACGCCAATCCAGGGCAGCGGATCTCGT 
CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTCAAGTGCCCTCACAGAGCTTGTCGC 
TAGACATATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 
GACTGGTCGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 
ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 
CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 
CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 
GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 
GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 
TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 
CATATCGGAGTTGCTCCTGT CGCAAATTAGGCGGAC C CAGAACAGC CATATGAGCCCACTGGAAG 
GGAGTCGATCCCAGTCGCCGAGCAGAGACGACACCAGCAGCAGCAGCGGCCACAGCAGTGTTGAC 
ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAC 
ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 
TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 
GAGGATATAGCCAGGACAGGGGCGACCAATTC CGCAAGATGCGAGGAGCAGC CGAC CACTCCAGC 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCATTCCAGGAGGCAAAGTCTGCTGGTT 
CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 

CACAAACATGGCATGCTCAGAGACCTCAACAATATTCCTCCATGA (SEQ ID NO: 84) 

lovE-3 6 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCACCAGTGGAGGGTTCACGCAC 

CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCACAAAAGATCA 

AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 

CGATGCGTCTACAGTGAGCGATGCCCCAAGCGCAAGCTACGCCAATCCAGGGCAGCGAATCTCGT 

CTCTGCTGACCCAGATCCCTGCTTACACATGTCCTCGCCTCCAGTGCCCTCACAGAGCTTGCCGC 

TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 

GACTGGTCGTGGACCTCGATTGGCACTGACGAGGCTTTTGACACTGACTGCTGGGGGCTATCCCA 

ATGTGATGGAGGCTTCAGCTGTCAGCTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 

GTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 

CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 

GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATCTTTTTCAATGCGTCACGAC 

GGCTTCTTACTGTCCTGCGCCAGCAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 

TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 

CATATCGGAGTTGCTCCTGTCGCAAATTAGGCGGACCCAGAACAGCCATATGAGCCCACTGGAAG 

GGAGTCGATCCCAGTCGCCGAGCAGAGACGACATCAGCAGCAGCAGCGGCCACAGCAGTGTTGAC 

ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAC 

ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 

T T AC T C TGGGAGTACACT C C GC CC AGGGCATT GC AGCTT ACATC AGCAAGAGCGGGGAAC CAGGC 

GAGGAT AT AGC C AGGACAGGGG CGAC CAATTC CGCAAGATGCGAGGAGCAGC CGAC CACT CC AGC 

GGCTCGGGTGTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 

CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 

C ACAAACATGGC ATGCTCAGAGAC CT CAAC AATATT C CTC CATGA (SEQ ID NO: 85) 


BOSTON 1322173vl 


-70- 


lovE-31 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 
CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCACAAAAGATCA 
AATGTATTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAACGGGCTGGACTT 
CGATGCGTCTACAGTGAGCGATGCCCCAAGCGCAGGCTACGCCAATCCAGGGCAGCGGATCTCGT 
CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTCCAGTGCCCTCACAGAGCTTGCCGC 
TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACT^GCTAC 
GACTGGTCGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 
ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 
CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 
CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 
GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCTTCACGAC 
GGCTTCTTACTGTCCTGCGCCAACAAGCTCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 
TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 
CAT AT C GGAGTTGC T C CTGT CGCAAATT AGGCGGAC C CAGAA CAGC CATATGAG CC CACTGGAAG 
GGAGTCGATCCCAGTCGCCGAGCAGAGACGACACCAGCAGCAGCAGCGGCCACAGCTGTGTCGAC 
ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAC 
ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGTATGAGA 
TTACTCTGGGAATACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 
GAGGAT AT AGCC AGGACAGGGGCGAC CAATTC CGC AAGAT GC GAGGAGCAGC CGAC CACT C CAGC 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 
CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 

CACAAACATGGC ATGCTCAGAGAT CT CAACAATATT C CTC CATGA (SEQ ID NO: 86) 

lovE-33 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 

CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCACGAAAGATCA 

AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAAGCTGGACTT 

CGATGCGT CTATAGTGAGCGATGC CC CAAGCGCAAGCTACGC CAAT CCAGGGCAGCGGAT CT CGT 

CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTCCAGTGCCCTCACAGAGCTTGCCGC 

TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 

GACTGGTCGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 

ATGTGATGGAGGCTTCAGCTGTCAGCTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 

CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 

CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 

GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 

GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 

TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 

CATATCGGAGTTGCTCCTGTCGCAAATTAGGCGGATC CAGAA CAGC CAT ATGAGCC CACTGGAAG 

GGAGTCGATC CCAGTCGCTGAGCAGAGACGACAC CAGCAGCAGT AGCGGC CACAGCAGTGTTGAC 

ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGATGAGCTGTTCTCCTATGTTGACCCCCTGAC 

ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 

TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACTAGGC 

GAGGAT AT AGTC AGGACAGGGGCGAC CAATTC CGCAAGATGCGAGGAGCAGC CGAC CACT CCAGC 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 
CCCGAAGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 
C ACAAACATGGCATGCTC AGAGAC CT CAACAAT ATT C CTC CATGA (SEQ ID NO: 87) 


BOSTON 1 322173v 1 


-71- 


lavE-39 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCACCAGTGGAGGGTTCACGCAC 
CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCACAAAAGATCA 
AATGTACTGGAAATAAGGAGGTTAATGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 
CGATGCGTCTACAGTGAGCGATGCCCCAAGCGCAAGCTACGCCAATCCAGGGCAGCGGATCTCGT 
CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTCCAGTGCCCTCCCAGAGCTTGCCGC 
TAGACATATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 
GACTGGTCGTGGACCTCGATTGGCATTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 
ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAGT 
CTACGGTTGAAAAAGCTCCGTTGCCACCGATATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 
CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 
GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 
GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 
TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGGCCGC 
CATATCGGAGTTGCTCCTGTCGGAAATTAGGCGGACCCAGAACAGCCATATGAGCCCACTGGAAG 
GGAGTCGATC CCAGTCGC CGAGCAGAGACGACAC CAGCAGGAGCAGCGGCCACAGCAGTGTTGAC 
ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAC 
ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 
TTACTCTGGGAGTACACTCCGCCCAGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 
GAGGAT AT AGC C AGGACAGGGGCGAC CAAT TC CG CAAGAT GCGAGGAGCAGC CGAC CACT CC AGC 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGCAAAGTCTGCTGGTT 
C C CGAGGT CGAAC CAT CG CAGC AC TG CGACGATG CT ATGAGGAT AT CTTT T C C C TCGC C C GC AAA 

C ACAAACATGGC ATGC T C AGAGAC CT CAAC AATATT C CT C CATGA (SEQ ID NO: 88) 

IovE-40 

ATGGCTGCAGAACAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 
CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCACGAAAGATCA 
AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 
CGATGTGTCTACAGTGAGCGATGCCCCAAGCGCAAGCTACGCC^TCCAGGGCAGCGGATCTCAT 
CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCGCCTCCAGTGCCCTCACAGAGCTTGCCGC 
TAGAAGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 
GACTGGTCGTGGACCTCGATTGGCACTGACAAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 
ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTTGAGT 
CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTACTCGTGCGGCCAGTGCGCAA 
CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 
GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 
GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 
TT ACGGAC CAAGAAC C T C TT TACGGC AGTACACTGTT ACATATTGGATGT GCGGAT TT TGAC CGC 
C ATATCGGAGTTGCTC CTGT CGCAAATT AGGCGGAC C CAGAACAGC CATATGAGCCCACTGGAAG 
GGAGTCGATCCCAGTCGC CGAGCAGAGACGACAC CAGCAGCAGCAGCGGCCACAGCAGTGTTGAC 
ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAG 
ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGATTGAGA 
TTACTCTGGGAGTACACTCCGCCCGGGGCATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 

GAGGAT AT AGC CAGGACAGGGG CGAC CAAT T C CGCAAGATGCGAGGAG CAGC CGAC CACT C CAGC 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGACTTTCCAGGAGGCAAAGTCTGCTGGTT 
CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCC TCGC CCGC AAA 
CACAAACATGGCATGCTCAGAGACCT CAAC AAT ATT CCTCCATGA (SEQ ID NO: 8 9) 


BOSTON 13221 73 vl 


-72- 


IovE-41 

ATGGCTGCAGATCAAGGTATATTCACGAACTCGGTCACTCTCTCGCCAGTGGAGGGTTCACGCAC 
CGGTGGAACATTACCCCGCCGTGCATTCCGACGCTCTTGTGATCGGTGTCATGCACAAAAGATCA 
AATGTACTGGAAATAAGGAGGTTACTGGCCGTGCTCCCTGTCAGCGTTGCCAGCAGGCTGGACTT 
CGATGCGT CT AC AGTGAGCGATGC CC CAAGCG CAAG CTACGC CAAT C C AGGGCAGCGGAT CT CGT 
CTCTGCTGACCCAGATCCCTGCTTGCACATGTCCTCACCTCCAGTGCCCTCACAGAGCTTGCCGC 
TAGACGTATCCGAGTCGCATTCCTCAAATACCTCCCGGCAATTTCTTGATCCACCGGACAGCTAC 
AACTGGTTGTGGACCTCGATTGGCACTGACGAGGCTATTGACACTGACTGCTGGGGGCTGTCCCA 
ATGTGATGGAGGCTTCAGCTGTCAGTTAGAGCCAACGCTGCCGGATCTACCTTCGCCCTTCGAAT 
CTACGGTTGAAAAAGCTCCGTTGCCACCGGTATCGAGCGACATTGCTCGTGCGGCCAGTGCGCAA 
CGAGAGCTTTTCGATGACCTGTCGGCGGTGTCGCAGGAACTGGAAGAGATCCTTCTGGCCGTGAC 
GGTAGAATGGCCGAAGCAGGAAATCTGGACCCATCCCATCGGAATGTTTTTCAATGCGTCACGAC 
GGCTTCTTACTGTCCTGCGCCAACAAGCGCAGGCCGACTGCCATCAAGGCACACTAGACGAATGT 
TTACGGACCAAGAACCTCTTTACGGCAGTACACTGTTACATATTGAATGTGCGGATTTTGACCGC 
CATATCGGAGTTGCTCCTGTCGCAAATTAGGCGGACCCAGAACAGCCATATGAGCCCACTGGAAG 
GGAGTCGATCCCAGTCGCCGAGCGGAGACGACACCAGCAGCAGCAGCGGCCACAGCAGTGTTGAC 
ACCATACCCTTCTTTAGCGAGAACCTCCCTATTGGTGAGCTGTTCTCCTATGTTGACCCCCTGAC 
ACACGCCCTATTCTCGGCTTGCACTACGTTACATGTTGGGGTACAATTGCTGCGTGAGAATGAGA 
TTACTCTGGGAGTACACTCCGCCCAGGGTATTGCAGCTTCCATCAGCATGAGCGGGGAACCAGGC 
GAGGAT AT AGC C AGGACAGGGG CGAC CAATT C CGCAAGATGCGAGGAGCAGC CGAC CACT C C AGC 
GGCTCGGGTTTTGTTCATGTTCTTGAGTGATGAAGGGGCTTTCCAGGAGGGAAAGTCTGCTGGTT 
CCCGAGGTCGAACCATCGCAGCACTGCGACGATGCTATGAGGATATCTTTTCCCTCGCCCGCAAA 
CACAAACATGGC ATGC TCAGAGACCT CAACAATATT C CTC CATGA (SEQ ID NO: 9 0) 


Equivalents 


Those skilled in the art will recognize, or be able 
to ascertain, using no more than routine experimentation, 
many equivalents to the specific embodiments of the 
invention described herein. Such equivalents are 
intended to be encompasssed by the following claims. 


BOSTON 1322173vl 


-73- 


